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Preface 


The present text “Statistical Methods in Education” has been 
written to meet the requirements of the students preparing for 
B. Ed. and B. T. examinations in which statistics is set as а 
subsidiary subject and is a compulsory part of the paper of 
‘Educational Psychology’. The book will be found useful to those 
who are beginners and want to have an Elementary Idea of 
Statistics and Basic Statistical concepts. 

Although ia an exteasive subject of study like statistics, it cis 
quite inevitable that no single text, however bulky it may be, can 
adequately cover up the whole range of the topics required in the 
text. Yet ап attempt has been made in this book to cover the 
courses prescribed for the examinations of B.Ed. and B.T. Many 
of these students have limited knowledge of Mathematics and 80 
the mathematical investigations have, in all cases, been rendered 
as simple and lucid as possible, and have been developed so as to 
direct attention to the statistical aspect of the subject. A number 
of illustrative examples which may readily be performed are fully 
described in each chapter, and numerous questions, mostly selected 
from universities and public examination papers, are appended. 
The present book, I believe, has a large number of tables, graphs, 
diagrams and exercises of all types firstly to satisfy the need of 
thos» for whom it is intended and secondly to illustrate the theory 
amply. 

The reader is expected to be familiar with Arithmatic of 
school levei and a limited knowledge of algebraic symbols. The 
writer has purposely avoided proving the formulae, lest it should 
unnecessarily burden the minds of the students. 

It is particularly important that the reader should not form 
the impression that Statistical Methods contain а series of 
iacomprehensible formulae to be applied indiscriminately to any 
available data. However care should be taken to grasp and 
appreciate the ideas and principles underlying the method learnt. 


Th: whole book is divided into four big chapters namely— 
Classification and Tabulation; Graphs and Diagrams; Measures of 


[У] 


Central Tendency [averages] and Dispersion, Correlation and 
Regression. The diagrams in chapter second and regression in 
chapter fourth are introduced in the text to Bive a better 
understanding to the reader regarding the Basic Statistical Concepts 
and an Elementary Idea of Statistics. 


Author of a text book is always indebted to all the earlier 
books on the subject and therefore my indebtedness to original 
memoirs and all the earlier books on the subject is acknowledged. 
lexpress my manifold deep gratitude to all the publishers and 
writers whose book I have often consulted. 


The author is indebted to all the colleagues for their critical 
comments and valuable suggestions which have undoubtedly 
improved the book. In particular, I consider it my noble duty 
to express my deep gratitude to reverend Gurujee Prof. Mahendra 
Pratap, Head of the Statistics Deptt,, J. V. College, Baraut, and 
Prof. S. P. Singh, Assistant Professor of Statistics, J. V, College, 
Baraut (Meerut) who have given their valuable assistance at all 
stages and who have placed their precious Collections of a few 
problems given in the text, I must, however, here return my 
thanks to Prof. A. C. Sharma and B. 5. Gupta. Assistant Professors 
in B. Ed., D. J. (Post-Graduate) College, Baraut, for their kind 
inspiration to write the text. Finally, my sincere thanks are due 
to Prof. Fauran Singh and H P. Singh, Assistant Professors of 


eir Courteous assistance and 

Ing through the press, 
Inspite of my best efforts, a number of misprints and mistakes 

are likely to have crept in a book of this type. I shall be grateful 


to any notice of these errors and misprints and f 
е ог ап i 
to improve the text, Y Suggestion 


Ist August, 1965. 
Baraut, Verma 


Introduction 


The word ‘Statistics’ is used in two sensenses, : Sometimes it 
is used, as a sigular and sometimes ` s plural: of ‘Statistic’. When 
it is used in the former sense, refers to the subject of Statistics. 
or the theory of Statistics. The later use of the word ‘Statistics’ 
refers to numerical statements of facts." For example, the height of 
individuals may be described by expressions such as 62", 66" etc.; 
the numerical statements of facts. 1f we describe the individuals 
as tall, short, medium etc., the statement is not numerical. 


Statistics as re erred in the former sense in the previous 
paragraph, has been defined differently by different writers. The 
fundamental reason for the differences in these definitions is that 
Statistics has been an ever growing subject. In the beginning, 
Statistics was considered as a state-craft only. In modern times 
it has pervaded almost in all branches. of human knowledge. 
Therefore the former definitions of ‘Statistics’ are not. so 
comprehensive as those given by modern writers. 


Bowley first defined Statistics as “the science of counting and 
the science of averages". Afterwards he gave another definition of 
Statistics as “the science of measurement of social organism 
regarded as a whole in all its manifestations." King defined it as 
“the method of judging collected natural or social phenomena from 
the results obtained by the analysis of an enumeration or collection 
of estimates.” These are some of the definitions given by earlier 
writers on the subject, but none of these definitions correctly refers 
to the present scope of Statistics. 


Statistics is loosely referred to two separate concepts 
(i) descriptive statistics, (ii) statistical methods. Descriptive 
Statistics means the collection and summarization of numerical 
data. The second concept is far more important as it deals with 
the various methods of representing, comparing, analysing and 
interpreting numerical data. Thus Statistics may be defined as 
the scientific exposition of statistical methods. 
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Statistical methods in modern sense range from the mere 
recording and tabulation of numerical data to “subtle process of 
inductive reasoning based on the mathematical theory of probability.” 
The present text deals with the simpler methods which involve 
little more than elementary arithmatic. The most attractive 
developments in technique which have made the science of 
statistics so important to the modern research worker are relatively 
recent ie. mainly in the last forty years. Га industry, statistical 
methods have been used to control the quality of the product. 
This technique is kno vn as Quzlity Control. . Another important 
use of statistical methods is in Agriculture and is known as the 
Design of Experiments. Statistical methods have also been 


abundantly used in Biology, Psychology, Education and Engineering 
in modern times. 


Statistical methods have proved their usefuiness in business 
and science, The need for obtaining statistical data in the smooth 
functioning of any business concerned increases along with size, 
The bigger the undertaking the greater is the need for Statistics. 
No business, large or small public or private cian thrive without 
the aid of Statistics. A manufacturer must know in advance the 
quantity to be produced, the number of workers, the amount of 
raw material etc. i.e. the details of a production plan. Statistical 
data and Statistical methods render valuable services in the proper 
understanding of economic problems. Economic problems are 
always capable of being reduced to numerical facts e.g. volume of 
trade (import and export), output of industries (manufacturing and 
mining etc.), agriculture (wages, livestocks, area and yield of crops 
agricultural prices and forest) etc. These numerical figures are 
affected by innumerable causes; and items vary according to time 
and place Thus the study of economic problems is particularly 
suited te statistical approach. This approach leads to its correct 
description. Five ycar plans have made considerable use of this 
treatment. Apart from economie policy, the development of 
economic theory based on inductive reasoning is also facilitated 
by the use of Statistics. Its importance in the field of MN 
science, and even in technology is in the increase, The pan 
of Statistics is also reflected in the large number of sciences vili 
set a separate paper for study. For this purpose, the Student В 
required to learn the basic ideas and principles of the subject Hi 

it is with such fundamentals that this book is concerned, 2 
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1t should be understood that it cannot be applied to all kinds 
of phenomena. The essential condition for the application of 
statistical methods is that the fzcts should be capable of being 
measured quartitatively. Health, poverty, intelligence are the 
examples where direct rumerical measurements are not possible. 
But the steps are taken to have an indirect measurement such as 
death rate, longevity of life etc. to make a judgement on the health 
of the people Intelligence of students may be compared on the 
basis of marks cb ained by them in a class test. Another defect 
is that statistics attaches ro importance to individual items. 
Therefore it is capable of studying those problems only, where 
group characteristic come in. One more defect of statistics lies 
in the fact that statistical laws are not correct like the mathematical 
laws. By obseiving a limited rumber of items (ie. by taking a 
sample) we infer ¿bout tle characteristic of the. whole population. 
Sometimes stétistics is used to draw wrong conclusions. There 
is an oft quoted sayirg that “there are three kinds of lies—namely 
lies, damned lies спа statistics! | Another saying is that “with 
Statistics an) thing can ђе proved.” If we make an analysis of such 
kinds of misgivings, we are apt to conclude that a number of 
persons exists in this world who is selfish and figures are capable 
of being manipulated. Therefore the useful application of 
Statistics depends upon the ability, skill and special experience 
and honesty of the statistician combined with the unbiasedness 
of the enumerator or investigator. 


¢ Classification and 


sub-divided, Percent 


Measures of centra] te 


ndency (averages). d 
advantages of mean, mode and meg; me 
Computing the averages ; their горе s; 
and their computations; Sures o Pers 
Tange, quartile deviatio > mean деу; 
deviation and their ¢ uta ions; 
variation, Standard err Varia ce, 
curves, 

Chapter ry. (Correlation an egression) 
Meaning of Correlation definition б 
Correlation; degree of Trelation Perfect 
limited degree of ve, Correlati limiteq 
of —ve and Perfect _ Correlation. Casup, 
Correlation ге, Scatter diagram, Pers 5 сост: 


Te abulation) 8 


lagrams Le, si 


Sular diagr 
ral Tendency and р; 


25 у 


correlation, regression line; range of correlation 
coefficient; definition of regression coefficient, 
Tegression lines of Y on X and of Xon Y; relation 
between correlation coefficient and regression 
Coefficients, tests of significance of the regression 


coefficients. ++» 149—184 
Appendix (1). (Tables) : 
Values of ‘t’, у? and Е distributions for 5% and 1% 
probability points. 
Appendix (2), (A. U. Examination Papers) : 


Unsolved papers of Agra University Examinations 
of B. T. & B. Ed. 


^ 


Chapter 1 


| Classification & Tabulation 
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The. raw statistical data which is collected in the course of 
an inquiry, usually consists of a series of readings and measure- 
ments. The original form of the collected data is very complex 
& voluminous and so it is very difficult to draw any inference 
from it as such. Thus, it is necessary to condence the data and 
present it in a systematic way: lt is done by the "Classification , 
& Tabulation” of the data. By these processes, the volume and 
complexity of the data are reduced, which make the analysis 
and interpretations of the data easier. 

Classification: —It is the process of arranging the 
individuals or items into groups or classes according to their 
similarities with respect to some variable character. 

The Advantages of classification:— 

(1) The whole data is divided into a number of classes 
as the items having resemblances with respect to certain 
character аге put together in one class. Thus, it indicates the 
point of similarity and dis-similarity. 

(2) The classification reduces the volume of the data 
which helps in forming the mental picture of the data. 

(3) It brings into light those important informations 
which are liable to be ignored without classifying the data, 

(4) It prepares the ground for comparisons and i nferea- 
ces. Because the classification of the data is not only sufficient 

or the comparisons and interpretations but it also helps in che 
tabulation of the data. Thereafter, the comparisons and the 
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interpretations on the data are possible. 

(5) The classification provides the orderly arrangement 
of the items of the data, 
Basis of classification: 

The classification of the data is based upon the characteri- 
stic possessed by the items contained in the data. There are 
two types of characteristics:— 

(i) Descriptive (ii) Numerical. 

(i) Descriptive-characteristic: Under this type 
of characteristic, the quantitative measurements are not 
possible but only the presence or absence of a particular chara- 
cteristic is observed inthe terms available in the data. For 
example, sex, literacy, unemployment and blindness etc are the 
Descriptive or qualitative characteristic of the items. 


Classification according to Attribute: The 
classification of the given data according to a descriptive chara- 
cteristic is known as the classification according to attribute. 
Here only the presence or absence of a quality (attribute) forms 
the basis of classification 

The classification can further be divided into two 
kinds;— 

(a) Simple classification or classification by dichotomy, 

(b) Manifold classification. 

(a) Simple Classification: The kind of classification 
is termed as Simple classification where only one attribute is 
taken as the criterion of classification. Here, the whole data is 
divided into two classes: — 

(1) Having those items which posses the 

quality ), 

(2) Having those items which do not possess the attribute 
For example; the population of a country can be dinde 
into two classes according to the attribute 


ге. unemployed and employed or accordin 
‘sex’ 1,e. male and female etc. 


attribute 


"Unemployment" 
g to the attribute 
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(b) Manifold Classification. The kind of classifi- 
cation is termed as Manifold classification where more than 
one attribute is taken as the criteria of classification. Here the 
whole data is divided into more than two classes. For example, 
the population of a country can be divided into four classes 
according to two attributes ‘sex’ & ‘literacy’ as shown in the 
diagram given below. 


Population 
у 
LEN (Se: T 
Маје Еетаје 
on i у Y 
} (literacy) 4 y (literacy) 
literate illiterate literate illiterate 


Thus the sub-divisions of the population are— 

(1) literate male (2) Illiterate male (3) literate female 
(4) Illiterate female 
Note:—' А 

In classification according to attribute, it is necessary 
that the classes should be clearly defined before starting the 
actual work of classification i. e. a boundary should be set 
between the classes. As for example, in case of literacy we 
must decide in advance to whom we shall call. the literate 
and to whom the illiterate before starting the work of classi- 
fication of the data. 

(ii) Numerical Characteristic : Under this type of 
characteristic, the quantitative measurements are possible. As 
prices, income, age, yield, height, and weight etc. are the 
examples of numerical characteristic. 

Classification according to Class-Interval . The 
classification of the given data according to a numerical chara- 
cteristic is known as the classification according to the class- 
interval. 1 
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In the classification according to class-interval, we must 
be ascertained before hand whether the data is discontinuous or 
a continuous one. The discrete variables are easier to deal with. 
An example of a discrete variable is the throws of a die. 


Example(1): Let a die be thrown 50-times, and noted 
the number at its upper most face each time. If the number 
obtained are 1, 6, 5, 5, 4, 3, 1, 2, 3, 4, 2, Oh 6, 1 272921 
5, 1, 6, 6, 4, 6, 4, 3, 1, 6, 3, 1, 4, 3, 5, 6, 1, 3, 6, 4, 1, 1, 4, 
3, 5, 3, 1, 2, 4, 4, 4, 2, 5, The variable (X) is the outcome 
of the number at the upper most face of the die in cach throw 
and it clearly assumes the values 1, 2, 3, 4, 5, & 6. 


In order to classify the data, these six values assumed by 
the variable (X) will be written in a horizontal row or a 
vertical column under the heading “Variable Value (Хун 
Since each value of X is repeated a n 
ber of repeatition of a variable value 
In classification, these respective frequencies for each value of 
the variable are written against the values of the variable in 
the 2nd. column under the heading "Frequency f ". As in the 
present example, the variable value <]> is repeated 10 times 
and variable value ‘2’ is repeated 7 times; so the respective 
frequencies for Х—1 & 9 are 10 & 7, 


umber of times; the num- 
is called its frequency (£). 


Frequency Table : The table 
values assumed by the variable and th 
is called a frequency table. 


» Showing the different 
er respective frequencies 


The fre 


ЊИМ quency table for this given example js shown 


e 


Classification & Tabulation 


Table No. 1 


Variable value| Frequency 


(X) (Е) 


| This method is adopted only, when the values taken Ьу 
the variate are not so many (numerous). 

On the other hand, when the number of values taken 
by the variate is large and the range between the greatest. and 
the smallest value is also large, then the entire range is 
| partitioned into classes of appropriate size by drawing the 
| arbitrary lines showing the corresponding frequencies of the 
variate in each class. Each class is defined by two boundaries, 
the Jower boundary and the upper boundary. The lower 
boundary of the class is called the Lower Limit of the class 
and the upper boundary is called the Upper Limit of the 
class. 

Class-Interval: It is the difference between the upper & 
the lower limit of the class. For example, for the class 25—85, 
the upper limit is 35 and lower limit is 25 and the C. I. =86— 


25=10. 
Variate Value : The mid-value of the class is called the 
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} у upper Limit+ lower limit | 
variate value i.e. M.V.— FPP 3 and so in 


the above mentioned class the M.V. = B42 _39 


Important points for frequency distribution : 
While making a frequency distribution, the following points 
should be borne in mind— 

(1) The class-intervals should be equal in each class to 
provide with casy computations. 

(2) The number of classes should not or 
20 and it should not be less than 6. Because for more than 20 
classes, the calculations become heavy and there js not much 
gain in the accuracy. Also, for less than 6 classes, there may be 
a great deal of loss in accuracy. 

(3) In general, there should be no class without definite 
limits as under ‘a’ or over ‘b’. 

(4) We must treat all the values assigned to the different 
classes equal to the mid-value of the classes. Hence the class- 
interval must be chosen in such a way that the average ofall ' 
the items in that class should not deviate much from the mid 
value of the class. : 

(5) A decision should be made before hand whether the 
class‘a—b’contains a lower limit of the class or b, upper limit of 
the class. A note of this should also be given, below the table, 

(6) The class-interval should be an integer as far as 
possible. 

Methods of Classification according to Class-interval:— 

There are two methods of class; i 
class-interval. 

(1) Exclusive Method (2) 

(1) Exclusive Method:— 


the upper limit of any class і 
succeeding class i.e. the class-limits are of the type a-b bec 
c-d. etc. Hence, there js a confusion about those it 
which are exactly equi] to the class-limits (called . p E 
line items ). Usually, the border line item; are sum 


dinarily excced 


Inclusive Method 
In this type of Classification, 
5 the lower limit of the 
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in the classes where they are as the lower limits of the class.As 
variable value *b' is placed in the class b-c. This confusion can 
be removed by having the classes of the type, a and under b, 
b and under c and so on. 


(2) Iuclusive Method:—In this type of classification 
there is a gap between the upper limit of the class and the 
lower limit of the succeeding class. Hence, it leaves no room for 
confusion for the border line items, Thus the classification will 
be of the form a-b, c-d, e-f etc, and the class a—b will involve 
all the items from a to b inclusive a & b within it. This 
method can not be used in the case of a continuous variate. 
Rules for making a frequency distribution: 


(1) Before starting the actual work of classification, a 
preliminary inspection of the data should be made and the 
difference of the highest & the lowest values should be divided 
by the desired number of the class to be formed. It will give 
an approximate idea of the class-in terval and will also help in 
setting the class limit . 


(2) After deciding about the class-interval and class-limits, 
a table containing three headings namely, class-interval, tally 
marks & frequency should be prepared. 


(3) Read for the items in the row table and for each 
items put the mark ‘1’ in its corresponding class in the column 
headed Tally marks. After putting 4 marks of the above type 
the fifth is obtained by crossing the 4 marks and thus forming 
agroup of 5 which helps in counting in the end. 

(4) The sum of the tally marks is written in the column 
of frequency against the respective classes. 

(5) The check of the sum of all the frequencies( xf) must 
show Xf equal to the total number of variate-values. 
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Example (2) 


The marks obtained by 60 students in 
Statistics are given below— 


22 47 9 
38 15 0 
24 22 26 
28 25 17 
26 29 27 


there are as many as 29 values of the 
occuring once, some twice 


at the most, 


data it is obvious that the Tange is 47-04 


42 31 17 13 15 18 


7, because the high- 
est value is 47 and the lowest value 1 i 


does not assume a 


NT 


of 20 shell cases— 
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Table No.2.- 


Class-interval | Tally marks 


Frequency (f ) 


TIT 


| zZzzzZzzzzz£z 


! i 
| Ina similar manner, we can form the frequency table 


(distribution) when the ungrouped data is such 
there Exclusive Me 


to а continuous variate and 
classification will be used. 


Example (3) . 


that it pertains 
thod of 


eats Khe following table gives the Vickers Hardness number 


the form of a frequency distribution ? 


66.3 613 627 604 .602 
645 7665 629 ` 6L5 "678 
"65.0 627 7 622 648  '058 
629 767-5 ' 6T5- 009. 638. Arrange theidata 


in 


» 
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Solution:— 

The largest value is 67:8 and the lowest is 60:2, so the range ` 
is 67:8—60-2— 7-6; it suggests us to start with 60°0 as our 
lower limit and to continue by 1 (unit) class interval. The 
variable is the Hardness Number and clearly it is а continuous 
variate. ! 

The frequency- table із given below— 


Table No 3 


Class-interval | Tally marks | Frequency (f ) 


60—61 ЇЇ 3 
61-62 | ui | 2 
62—63 NT 5 
63— 64 ү! à 
64—65 T v 
65—66 || à 
66—67 | 2 
67—68 ||| 3 | 


Totals 


Cumulative Frequency Table: In some of the 
statistical investigations (Educational tests, wages or salar 
statistics etc.), we require the number of variates which coe 
‘less than’ or ‘more than’ a given value. For this purpose, j N 
necessary to change an ordinary frequency table itio B 


‘cumulative frequency table, It can be done i ; 
зе сап be done in the following Two 
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`(1) Less than type: 
Suppose, we are given the daily expenses for one week 


as follows: — 


Table No. 4 
| Days Ro) 

Monday 4 
Tuesday 6 

еду 10 

Thursday 16 

Friday 12 
Saturday 8 
Sunday 4 


Now if we want to know the total expenses upto 
Wednesday then it will be given by the sum of the expenses 
for Monday, Tuesday `& Wednesday і.е. 4+6+10=20. 
Similarly the total.expenses incurred upto any day of the week 
can be obtained by adding the expenses from Monday upto 
that day. In the tabular form, the total expenses can be 
represented as given below in table No. 5. 


1 


^W 


b: 


ET 


12 
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Table No. 5 
: Days ` ^| Expenses б] the day 

Monday 4 

Tuesday 10 
Wednesday 20 
Thursday 36 
Friday 48 
Saturday 56 
Sunday 60 


On the other hand, the expenses after any day (includ- 


ing the day) will be obtained by adding the expenses for that 
day and after-wards. The following table No. 6 serves this 


purpose. 


Table No. 6 


Expenses after the day 
(Rs.) Ht 
Monday 60 
Tuesday 56 
Wednesday 50 
Thursday 40 
Friday 24 
Saturday 12 
Sunday 


x — M P9 
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It the days are replaced by the variate x, by its values 
SUNY UE ху. then the values in the 2nd. column of table 
No. 5 show the number. "ess than’ the variate value and 
column 2nd. of the table No. 6 shows the number ‘more than’ 
the variate value. 4 : 

_ These frequencies, which are less than or more than the 
variate value (as shown in second column of table No. 5 & 6) 
are called the ‘Cumulative Frequencies. 

The similar procedure is adopted when instead ofthe 
variate value, the classes are given. 

Example No. 4 у 
: Form a cumulative frequency table from the following 

ata: — 
class: 0-3 3-6 6-10 10-12 12-15 15-19 19-20 20-23 23-25 
Frequency:2 4 5 3.3. 11 12 14 10 
Solution:— 

(i) In making the cumulative frequency table of “Less 
than type”, the first class will be replaced by under 3, second 
class by under 6 and so on. The cumulative frequencies will 
be the sums of the frequencies upto that class for all the 
respective classes. The cumulative frequency tabale of “Less 
than type" is given below:— 


Table No. 7 
Variate vali Kad "ber: 
Under 3 2 
Under 6 6 
Under 10 11 
Under 12 18 
Unde 15 — 26 i 
"Under 19 37 
Undér 20 | 49 
Under 23 63 


Under 25 278 
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(2) More than type: 

(8) In foming the cumulative frequency table of “more 
than type”, the first class is replaced by more than 0, second 
by more than 3 and so on. The cumulative frequencies аге the 
sums of frequencies onwards this class for all the respective 
«classes. The cumulative frequency table of “more than type" 
is given below:— 


Table No. 8 
Variate Value aan ee 
More than 0 73 
More than 3 71 
More than 6 67 
More than 10 1' 62 
More than 12 55 
More-than 15 47 
More than’ 19 36 
More than 20 24 
More than 23 10 


Thal TUER EE e ee PR 
Note: (1)— 
From the above example, it is clear ў 
4 , that in t 66 
than type’ c. freq. table, we have to replace {е 
by under’ or ‘below’ the ‘upper limits of the classes? oe 
in the “More than type” el ond 


с. freq. table we 
the classes by ‘More than or over’ the boe ui 
е 
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c lasses’. | 
ш an ordinary freq. table from а c. freq. 
table:— 

There are the cases when we want to change the cumulative 
frequency table into an ordinary frequency table. For example, 
if for every unit of consumption of a certain commodity, its 
total utility is given and from it we want to know the marginal 
utility for every unit of consumption then it will be a case 
of forming an ordinary freq. table from a cumulative one. 
Example (5) 

In the table given below, the total utility fora unit 
consumption ofa certain commodity is given. Find the 
marginal utility for every unit of consumption ? 

Unit : 1 2 3 4.5 67.899 ШОШО 
Total utility: 3 11 21 33 49 63 73 81 98 103 107 108 
Solution:— : 

We know that:— 

the total utility for ith unit—the marginal utility for Ist unit 
Р -++the marginal utility for itè unit, 

From this, it is clear that the relation between the total 
utility and the marginal utility is the sameas between the 
cumulative frequency and the ordinary frequencics. 

Here we have to convert a cumulative frequency table. 
into an ordinary frequency table as shown below:— ' 


Table No. 9 


Unit Unit Маа 
1 3 A 10 
2 8 8 8 
3 10 9 17 
4 12 10 5 
5 |. 16 11 4 
6 14 12 1 


16 , Statistical Methods in Education 
Note: (2)— 


"To form the ordinary frequency table from cumulative 
frequency table oj “less than type” we have— 


frequency for 5" class=c. f for i^. class—c. f. for 
(i—1)™ class. 


Similarly, to convert ac: ў: table of “тоте than type’ 
into an ordinary ferquency table we have / 

frequency for i? class=c. f. for (i)™ class c. f. 
for (i+1)™ class. 


Example (6) 


From the following data, find.the number of persons in 
the clas es 20--25, 25--30...... .. and 55 -60. 
Age тоге than : 20 25 30:35 '40 45 '50 '55 
Мо. of persons: 800 750 680 580 400 250 130 60 


Solution:— 


We have been given that the number of persons who 
are old than 20 is 800 and those who are old than 25 is 750. 
Thus the no. of persons who are between the age of 20 & 25 
is 800-750=50 only. In the same way, the по. of persons 
between 25-30, 30-35 ........ and 55-60 can be found. 
Because the formula used is— 

frequency for i class=c. f. for 1% class—c. f. for 


(11) class. 


Thus the table on the next page shows the no. of persons 
between the different age-groups.— - 
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Tablo No. 10 


Age-group | No. of persons 


20—25 50 
25—30 70 
30—35 100 
85—40 180 
40—45 150 
45—50 120 
50--55 70 
55—60 60 
Total 800 


Example (7) 
Obtain the ordinary frequercy table 


following tables:— 


from each of the 


(1) (2) 
No. of Students No. of Students 
Marks Below 10 3 Marks above 0 30 _ 

> 2.320 8 aa cay ГА 26 
d » 30 17 MO NE SU 21 
m „ 40 20 i "30 14 
D ON) 22 „ » 40 10 

50 9 


Solution:— 
(1) Here the c. f. table of ‘Less than type’ is given and 
we have to convert it into an ordinary frequency table. The 


frequency for it) class—c. Ё. for if" class—c. f. for (i-1)™ 


class. Thus, the table given below shows the desired no- of 
students in the different groups of marks:— 
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Table No. 11 


rks | No. of students 


— 


0—10 
10—20 


30—40 


3 
5 
20—30 9 
3 
40—50 2 


Total 22 


(2) Here the c. Ғ table of 


and wc have to convert it into 


The frequency for ith class= 
class. Thus, the table given 
students in the different group. 


с. Ё for jth ciass 


below shows the desired 
s of marks: 


"More than type” 


is given 
an ordinar 


y frequency table. 
=c. f. for (i41) 


Table No. 12 

<<. 
Marks | No, of students 
ВЕРНЕЕ: 
0—10 4 
10—20 5 
7 

4 


E Tt 


20—30 
30—40 
40—50 10 


[racc ПАКИ 
Total 30 


EE Eco d 
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Tabulation: — 

It has been mentioned already that the classification 
alone is not sufficient for comparisons and inferences. The data 
is further put to the treatment of tabulation which make it 
fit for further analysis and comparisons. 

Thus the T'abulation is the process of arranging the 
classified data in an orderly manner into rows & columns in 
such a way that it makes clear all the important informations 
contained in the data. 

For tabulation, the following points should be kept in 
mind— 

(1) Every table should have a title, explainatory in itself 
and it should provide the informations regarding: — 

(a) What the data are. (b) Where the data are. 

(c) Principle of clasification. (d) Time of data. 

(2) The rows & columns must be arranged in a logical 
ordcr to facilitate the comparisons. 

Theheadings & sub-heading should be concise and 
wi:hout any ambiguity. 

(4) The units of the data presented, should be clear. 

(5) The complicated tables should be avoided and so 
they must be broken up into parts, in more than one simple 
tables. 

(6) The size of the table shonld suit the size of the paper. 

(7) The table should be presented in a neat, clean and 
comprehensive way by drawing the double lines wherever 
rcquired. 

(8) The source of the table should also be written. 

Types of Tabulation 
There are two types of tabulation: — 
(1) Simple tabulation (2) Complex tabulation. 
Simple Tabulation:— 

Ina simple tabulation, the data is presented with respect 
to onc character only or in other words, a simple table is 
capable to provide the information relating to one character 
only. For example, the population of India can be divided 
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according to religion. The table for this purpose will be as 
follows:— 


Table Ne. 13 


Religion | Population 
с=с кА arua t Toe 
1. Hindu 


2. Muslim 
3. Sikh 


4,: Christian 


5. fain 


6. Others 


(2) Complex Tabulation:— 

A complex table gives- the informari 
several characters, This type of tabulation js further classified 
as ‘Double. Tabulation? "Triple ` or 
‘Manifold Tabulation’, | 
Double or two fold Tabulation:: 

n this type of tabulation, the data ig divided according 
to two characteristics. For-exainple,! the Population of India 
сап be divided according to religion: & States'and the table for 
it is given below» 
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Table No. 14 


Population 
| States Total 
| YR | у 
| Hindu [MuslimSik h Cam Jain} others |. 
| SES 4 21 тик 
l l. Assam 
2. Bihar 
3. Punjab 
4. U. P. | 
| 
| Fî —— سا‎ 
ا ا‎ 1 


Treble Tabulation : 


the data is sub-divided accor- 
reble tabulation is capable of 
ally dependent qnestions. The table given 


£ treble tabulation. It shows the distribu- 
religion and 


In this type of tabulation, 
. ding to three characteristics. T 
answering three mutu 


below is an example o 
tion of India’s population according to States, 


sex. 
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Table No. 15 


тип 


Population 


States Hindu |Muslim | Sikh [Others Totals 
T т di 
ziglia 3 azg alz sla 
еВ ojs > g|[gisis[gls S 
مإ اة‎ = ISEB |Р. 
| | 
1. Assam | | 
2. Bihar | | 
3. Punjab || | 
4. U P. | | | 
| | | 
| | 
| | | 
| | | 
Total 
Manifold Tabulation : 


In this type of tabulation. the data is divided with respect 
to more than three characteristics. From this type of table, we 
can get the information relating to several (more than three) 
characteristics. An example of this type of tabulation is the 
division of India’s population according to States, religion, 
sex, literacy and age ete, 

Example (8) 

Draw up in detail, with Proper attention to spacing 
double lines etc. and showing all the sub-totals, a blank table 
in which could be entered the numbers occupied in the a 
industries at two dates ( July 63, July 64) distinguishing males 
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from females and arrong the latter,single, married and vidowed. 
Solution:— 
The desired table is given below:— 


Table No. 16 


No. in July,63 | No. in Јају,64 Totals 


x 
5 - 
8 Female Female Female S 
cM Е ч [е У Ка 

== x о E S32 [s < © 
z = psz £ БЕЧ ИЧ 
E ао ا‎ 
E Hg. <) mz] = 
olo zu f 
a 2/24 | = 
1 
Y | 
3 
4 
| 
5. | 
6. 
Total | 


The above table shows the no. of employees in six indu- 
Stries according to their distribution into sex and two dates. 
Example (9) 

Draw up two independent blank tables, giving rows, 
columns, and totals in each case, summarizing the details about 
the number of families, distinguishing males from females, 
earners from dependents and adults from the children, 
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Solution:— 

The desired table showing all the required informations 
will be а complex table and it will be tedious to understand it. 
For the sake of convenience we partition this table into two 


simple tables: — 
(i) Giving the distribution of all the male members in 


the families according to age and earning; 
(ii) giving the distribution of all the female members in 


the families according to age and earning. 
(1) Table—Showing the distribution. of male members: 


according to age and earning — 


Table No. 17 (1) 


——— 


Totals 


E 


Family | _ 


Dependent 


> 
E 
5 
о 
ч 


Total 


~ 
© 
2 
[пру 
шәрә 
= 
© 
[m 
[ару 
| upi? 
"esc X 


пару 
шорчо 


миды Жыла ا‎ 


wenstee] 


| cute Ee ms us suc] 


Totais 
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(ii) Table:—Showing the distribution of female members 
according to age and carning— 


Table No. 17 (ii) 


Earner Dependent Totals 
Family 
> E "PES || > | E | Total 
=; | Tota 2. | Tota E. ota 
a | =: Е | а: gg 
cS a|s mg 
i | 
2 
3 
4 
| | | | 


Chapter II 


Graphs And Diagrams 


After condensing and summarizing the complex and 
numerous data in a systematic manner by means of classification 
and tabulation, we require certain divices which may present 
the condensed form of the data. This condensed form of 
the data is such thatit becomes at once comparable and 
leaves an everlasting impression on the brain of the observer. 
One of the methods making the data intelligible is to 
represent it by means of graphs and diagrams. The graphic 
& diagrammatic representation of the data is always appealing 
to the eye as well as to the mind ofthe observer. 
Advantages of Graphs & Diagrams:— 

(1) The graphs & diagrams represent the data in 
attractiveand appealing way both to the eye and mind. 

(2) The graphic & diagrammatic representation of the 
data leaves an everlasting effect on the brain. 

. (3) The diagrams are not only attractive and impressive 
but they save time also, Because through the diagrams, it is. 
possible to have an immediate grasp of significance. 

(4) Another merit of the diagrams is the ease with 
which the two sets of the data are compared with each other. 

(5) Forecasting becomes easier with the help of the 
graphs. 

(6) The graphs are help 
they give an indication of corre 

(7) The partitioning values (med 


ful in the interpolation also and? 
lation between the two variables. 
ian & quartiles еіс.) 
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and mode are also determined by graphs. 
Limitaiions of graphs & diagrams:— 

(1) The graphsand diagrams provide an approximate 
picture of the data and not the accurate. one, Thus they are 
useful in representation to general public but not to a 
Statistician ог ап investigator who is interested in the 
detailed study. 

(2) They are used to compare only such data which are 
technically comparable. 

(3) "They are easily capable of misuse. 

Graphical Representation of Frequency Distributions: 

In the graphical representation of the frequency distribu- 
tions the horizontal axis of the graph is used to show the 
variate values and the vertical axis for the frequencies of the 
variate, The graphs of the frequency distributions are of the 
following types— 

(1) Histogram (2) Frequency-polygon and Frequency 
Curve \3) Cumulative frequency curve or ogive. 

(1) Histogram : It is composed of a set of rectangles 
One over each class-interval on the X-axis. The width of the 
rectangle is taken proportional to the class-interval and its 
height is taken proportional to the frequencies in the case of 
equal intervals. But in the case of unequal class-intervals the 
height of a rectangle is taken proportional to the ratio of 
frequencies to the class-interval and thus the area of the rectangle 
is proportional to the frequencies of the variate or the classes, 
Example (1) :— 

The following is the frequency distribution of the yield 
of sugar-cane in tons per acre. 

Class : 35-40 40-45 45-50 50-55 55-60 60-65 65-70 
Frequency : 7 8 12 26 32 15 9 

Drawn a histogram representing the above distribution ? 
Solution:— 

In this example, 


the class intervals are equal and so the 
heights of the rectangles 


willbe’ proportional to the frequencies, 


| 
| 
| 


| 


j 
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Graph No. 1 
HISTOGRAM 


“representing the yield of sugarcane in tons/acre” 


v. —1 unit of frequency 


V.S. — — —1 Small 


Q5) 4 ду бо 46 б 6 e 

H.S.— — 1 Small фу. —1 unit of Variate Value 
Example No. (2): — 

Draw the histogram of the following distribution: — 
Heights: 48-50 50-54 54-50 56-58 58-59 59-60 60-63 
(in inches) 

No. of Boys : 
(frequency ) 1112 60 20 


Solution :— 
In this example; the classintervals are not equal, so the 


heights of the rectangles will be proportional to the ratio of 
the frequencies to their respective class-intervals. Thus the 


16 5 2 3 
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heights of the rectangles are given in the following table 
H { 
against the respective classes:— , 


Table No. 18 | 


— 


Classe: | Frequency/Class Interval 


48—50 12/2— 6 
50 – 54 60/4—15 
54—56 20/2—10 
56—58 16/2— 8 
58— 59 5/1= 5 
59—60 2/1= 2 
60--63 8/81 

Graphs No. 2 

HISTOGRAM 


"representing the heights of the boys” 


Es 
T 

= d 

ч H 
as : 
1 H 
> (49,0) 5° 585469 — &X 


H.S.—— 5 Small divs. —2” height 


| 


Frequency Polygon & Frequency Cur 


frequency polygon 15 construc 
means of straight lines w 
classes and the ordinates are 


| frequency polygon can а so be obtaine by joining the mi 
points of the upper sides of a histogram. by straight lines 
free hand curve drawn 


| 
| 
| 


А frequency curve is а 5020010; 

through all the points which are obtained for the frequency. 
olygon The area under the curve should be equal to that © 
the histogram- Frequency polygon is used to find the ‘mode’ 


Example (2а) 
Draw a frequency ploygon & a frequency curve for the 


data in example (1) on pase 8. 
Solution: 
In this example, the points of the classes are plotted on 
the X-axis and the frequencies on the Y-axis. 
h No. 3 (a) 


Representing the yield of sugar cane in tons/acre- 


.—1 unit of frequency 


АА 
а=) 
E 
g 
un 
= 
1 
| 
л 
> 
jan | 
des Has р! 55 
HS.—— 1 Small div.—1 unit of variate value 
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Example 3 (b):— 
The following table gives the distribution of height in 
inches for 100 students. 


Interval Frequency 
>57 upto 60 3 
260 , 63 12 
263. 152568 31 
266 69 37 
200 , 72 16 
>72 LET 1 
Total 100 


Represent the data in the form ofa histogram as well 
as a frequency polygon: 
Solution:— 
See the graph No. 3 (b) 
Graph No. 3 (b) 
HISTOGRAM & F. POLYGON. 


Representing the distribut on of height for 100 students 


V.S.— —.1 Small div.— 2 Students 


75 


H.S. — ——. 5 Small divs. — 37 of height 
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(3) Cumulative frequency curve or ogive : The 
ogive is constructed from а less-than type cumulative frequency 
table. The upper limits of the classes are taken as abscissae 
and the corresponding cumulative frequencies as ordinates and 
thus all the points are plotted on the graph. The free hand 
smooth curve through all these points is called the og?ve. 
Ogive is helpful in determining the partitioning values 
(median, quartiles etc.). 

Example (4):— | 

Draw а cumulative frequency curve for the data given 
in example (1) on page 28. 

Solution:— 

First of all, we shall construct the cumulative frequency 

table from the given data as follow:— 


Table No. 19 
Variate Value | Cumulative 
Under frequency (c.f) 

Under 40 1 

Under 45 15 

Under 50 2, 

Under 55 53 

Under 60 85 

Undek 6B a bg: 100 

Under 70 .| 109 " 


ل ل 
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The cumulative frequency curve is shown below:— 


Graphs No. 4 


Representing the yield of sugar-cane ín tnos/acre 


cmm 
41 


V.S.— — 51 Small div.—4 units of frequency 
E EE E ER ЕЕЕ i 


H.S.— — >1 Small div.—1 unit of variate value 


_ Gap between the upper & the lower limits of the 
adjacent classes:— — . 


In the case of the grouped data of the type a-b, b.c 
c-d, and so on, we have seen that the graphs of the BOE 


> aboy 
three types can easily be used, But in case, Х 


where there is 4 
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gap between the upper and the lower limits of the adjacent 
classes, the class limits should’ be so modified that the gap 
vanishes and the upper & lower limits of the adjacent classes 
become the same. 


Example (5):— 
Draw the histogram, frequency Polygon and the ogive 


for the following data:— 


Scores : 20-29 30-39 40-49 50-59 60-69 70-79 
No. of students : 2 14 22 20 14 3 
Solution:— 


In this example, there is a gap between the upper & the 
lower limits of the adjacent classes. Hence, first we modify the 
classes to have the continuous from of the data and thus finish 
the gap. For the construction of histogram, frequency polygon 
& the cumulative frequency curve for this data, we form the 


following table:— 


Table No. 20 

Frequency] Comulative 
@ ЕЕЕ) 
195—295) 2 2 
295—395) 14 16 
39:5 49:5) 22 38 | 
49:5--59:5| 20 58 
59:5—69:5 14 72 
695—795 3 75 
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Graph No. 5 (a) 
HISTOGRAM & F. POLYGON 


“representing the scores for Students" 


V.S.— — —1 Small div.— [Student 


(5,0) 195 145 35 U5 595 645 9% 


H.S.——->1 Small div.—2 units of scores 


T 
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Graph No: 5 (b) 
Cumulative-Frequency-Curve 


3o 


V.S.— — 51 Small div. —3 Students 


ДЦ 
5 LJ 
пшнш 
WEBEHUEHEP FA 


(t9 5,0) 445 395 495 545 8698 735 


H.S.—1 Small div.—2 units of scores 


37 
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Graph For Discrete Variable:— When the data is 
given for a discontinuous variable accompanied by its 
corresponding frequencies, then the graphical representation of 
such a data is made by drawing thick lines parallel to the 
Y-axis. These lines are drawn at the pcints of the discrete 
values of the variable shown on X-axis and their heights are 
proportional to their corresponding frequencies. 

Example No. (6) 

Represent the following frequency- distribution 
graphically: — 

DOES UT 2 3 4 5 6 

920 15 17 19 13 16 
Solution:—- 


In the graph, the variate values are shown on the X axis 
and their corresponding frequencies on the Y-axis. 


Graph No. 6 


“Graph for discrete variable” 


V.S.—1 Small div.— 1 unit of frequency 


H.S.—— — 5 Small divs.— 1 unit of variate value 
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Graph for Time Series : ( Historigram ) The 
Historigram is the graph obtained from the time series data. 
In these graphs, the time variable 42 js measured along X-axis 
and its corresponding values ‘u along Y-axis. All the 
available points form the data are plotted on the graph and 
joined by means of straight lines. 

Following few examples will illustrate the type of 
graph:— 

Example No. (7): — 

The following table provides the yearly figures of 

production of sugar in lakh tons— 


Year production of sugar 
(lakh tons) 
t 5: 
1948 9:65 
1949 8:90 
1950 6775 
1951 10:50 
1052 11:25 
1953 10°70 
1954 12:35 
1955 12:55 
1956 12:95 
1957 15:00 
1958 13:50 


(i) Draw the graph to show the fluctuations of the 
production of sugar. 

(ii) Comment on the fluctuations of the production. 
Solution: — 

(i) The production of the sugar in lakh tons is taken 
along Y-axis and years on the X-axis in the following graph. 
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Example No (8):— | 
Represent the figures given below, оп а graph-paper and 
comment on trend shown by the data. 


Year с - Price for Arhar 
in Rs./maund. 

1929 4:0 

1930 46 

1131 ‘ 36 

1932 3:6 

1933 38. 

1934 33 

1935 : 47 

1936 ЗАЎ 4 

1937 $ 4:3 у 

1988 : 4:3 

1989 я d 4:2 

100073 3 4 39 


Solution : : | : 

An eye inspection of the graph shows that the prices 
are neither decreasing nor increasing but they are fluctuating 
around the price Rs. 3.9/maund, the mean value. 
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Example No. (9):— ў 
Draw the graph from the following table, in which. the 
growth of the three kinds of wheat may be read from the 
graph. Discuss the growth of the three kinds of wheat ? 
(U. P. Board 1956) 


EET SS а аи 


N Age у 
~ Days Height in centimeters 
N 
Kinds N 
x | 15 | 30 | 45 | 60 | 75 90 105 | 120 


N 


Lacum N 


Pb, |30 |35 |70 19:0 26:0 | 510 | 980 1065 
Cis 2:5 |4:0 | 10:0|31-0| 44:0 820 | 1115, 1195 


Local `|3° | 45 |12:5| 22:0) 37.5 775 | 1090| 1125 


А 


Solution:— 


Of all the three varieties of wheat, the growth in Cys 
is minimum upto the age of 45 days in comparison to the 
local variety and after this, the growth of Cys is the highest, 
while that of РЬ is minimum. The growth of the local 
variety is between the two. 


Graph. . (9) 
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Representing the g wth of three kinds of wheat 


(ЕУ Жайчы ри чы та ш: п шаптапа РАВНА ЕЦ 


اا 
H ашивиктиин g‏ 
Be st ELULI‏ 


V.S.——-—>1 Small div.—3 cms. 


А Ru 
45 75 qo 105 ‘20 
H.S.——-+1 Small div.—3 days. 


An inspection of the gtaph clearly 
[l'age of 45 days, the growth in'local variet 
after it that of Cg is the" highest. Т 
character of growth, Cj, is the best. 


shows that’ upto the: | Y : 


yis the highest and, ~. а 


hus, as regards ‘the’ ' 
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Example No. (10):— 

The-average yields in maunds: per acre of rice and wheat 
crops ina state of India since 1950-51 are given below. 
Comment whether the yields аге increasing or decreasing 2 


Year Rice Wheat 
1950—51 527 8:88 
1951—52 438 8:25 
1952-53 125731 | 9:21 
1958—54 6:46 а ONG 
1954—55 6:12 9:64 
1955—56 718 -- 826 
1956—57 621 . 8:46 
1957 - 58 619 -. ` 1:89 
1958—59 713 8:61 
1959 —60 6:15 9:21 
1960—61 . “ТАБ 10:91 " 
1 ; (M.Sc. Ag. 1962) . 
Solution:— ; 


Let us first plot the yields of псе and wheat since 
1950—51-to-1960— 61 on the graph-paper. AS a result of 
this, we obtain the following graphs. : 


^ 


An eye+inspection of thé above graphs; tell us that— 
(i) the yield of wheat has an ‘increasing tendency. "The yield 
from 195051 to 1954—55 is increasing except a slight 
downward fluctuation forthe year 1951— 52. Further the 
yield is decreasing upto 1957—58 and again increasing for 
the remaining years. 

(ii) The yield of rice is increasing from 1950—51 to 
19 55—56 excepta slight downward fluctuation for the year 
1954—55. For the next two years, the yields are decreasing 
and are the same. For the year 1958—59, the yicld increases 
and again decreases for the year 1959—60 and increases for 
1960—61. Thus on the whole, we can say that the yield 
shows an increasing trend. 
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Example No. (11):— 

The following table provides the yearly figures of 
production in lakh tons in U. P. Comment on thé wheat 
production in U. P. 


Year Yield Year Yield 
1943—44 19.01 1952—53 28.24 
1944—45 23:56 1953—54 31:06 
1945—46 22:83 1954—55 32:84. 
1946—47 23:37 1955—56 30.41 
1947—48 21°94 1956—57 31:15 
1948—49 19°74. 1957—58 27:06 
1949 —50 24-26 1958—59 ' 80:36 
1950—51 26:78 1959—60 32:42 
1951—52 25°33 1960—61 38:82 


(M.sc. Ag. 1963) 
Solution: 

The yields of wheat in U. P. since 1943—44 to 1960 — 
61 are plotted on the graph-paper as shown below. 

The graph shows that the yields are increasing upto 
1945—46 and further decreasing for the next two years. 
Again from 1948—49 to 1954—55 the yields are increasing 
excepta downward fluctuation for the year 1951—52. After 
this, the yield neither shows an increasing nor a decreasing 
tendency, it merely fluctuates around the yield 30 lakh tons. 
But for the year 1960— 61, the yield increases, Thus on the 
whole, there is an increasing trend in the production of wheat, 
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Diagrams 

The following are the important diagrams — 

(1) One dimensional diagrams, 

(2) Two dimensional diagrams, 

(3) Three dimensional diagrams, 

(4) Cartograms and 5 

(5) Pictograms. NS 

(1) One Dimensional Diagrams:— The onc dimensional 
diagrams are the lines or bars arranged in ascending or descend- 
ing order of magnitudes (length) along a vertical or a 
horizontal scale, The lengths of the bars are proportional to the 
magnitudes of the items. 

(2) Two Dimensional Diagrams:—!n the two dimen- 
sional diagrams, the magnitudes of the items are represented by 
the areas as by squares, rectangles and circles etc 

(3) Three Dimensional Diagrams:—1n the three 
dimensional diagrams, the data are represented by the cubes or 
cylinders and the magnitudes ofthe items are represented by 
the volumes of the cuboids or cylinders. 

(4) Cartograms:— Here the data is presented by means 
of maps. 

(5) Pictograms:— Here the data is presented by means of 
pictures. 

But the bars and circular diagrams are most commonly 
used because of their accuracy and casiness in sketching. 

“One Dimensional Diagrams" 

'The bar diagrams of common use in the one dimensional 
diagrams are of the following main ty pes: 

(1) Simple bar diagrams, | 

(2) Multiple bar diagrams, 

(3) Sub-divided bar diagrams. 

(1) Simple bar diagrams:—Here the magnitudes of 
the items are represented by thick bars of uniform width with 
equal spacing between any two consecutive bars. The lengths 

of the bars are proportional to the magnitudes of the items. 
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One bar is drawn for each item and they are arranged in the 
ascending or descending order of lengths along a vertical or 
horizontal line. These simple bar diagrams are appropriate for 
the data contained in simple tables. 
Example (12) 

Draw a simple diagram to represent the following 
statistics relating to the area under different crops:— 


Million Acres 
| 
Rice 80:3 
Wheat 27:6 
Jowar 214 
Other food crops 88:2 
Oil seeds 17:6 
Cotton 14:5 
Other fibres 31 
Fodder crops 10:2 


Other non food crops 3:9 
J 
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Solution:— 


The Acreage ( million acres ) under different crops is 
shown by the simple bar diagrams: — 


"dodo ii Чон yan О |- 
HAW HSH E ЖЕ] RE Ет 
ш ЕЗИ] eee 


Diagram No. 1 
*Simple Bar Diagrams for Acreage under different Corps" 
H.S.— — 3 Small divs. width of each bar and 


2 Small divs. = Spacing between two consecutive bars. 


soxoy WONN £= AIP TEMS [= SA 
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(2) Multiple Bar Diagrams:— These bar diagrams 
are constructed like simple bar diagrams. They represent more 
than one type of data at a time and so. two or more bars ( as 
the case may be) are constructed at a time side by side. 


Example 13 (a) 


The following table gives the number of motor-cars 
produced in the three countries during the period 1929 —1935. 


Year Germany France U.K. 
1929 96 254 241 
1930 74 23) 241 
1931 68 201 226 
1932 50 172 248 
1933 99 189 296 
1934 172 187 355 
1935 245 166 417 
Solution:— 


Taking three bars at a time, side by side, we completed 
the diagram. Here the time ( year ) is taken along the X-axis 
and the number of cars produced in these countries along 
the Y-axis. 


= 10 motor-cars 


V.S. ——— 1 Small div 
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Diagram No. 2 (a) 
“Multiple Bar Diagram for No. of Cars 
Produced in U K.,France & Germany” 


1929 1930 1931 1932 1933 1934 1935 
HS. ———6 Small divs.—1 year 
2 Small divs.— width of each bar 
1 Small div.— spacing between each 
set of three bars 
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Example No. 13 (b):— 


The following table provides the percentage of cultivators 
and percentage of cultivated area in different sizes of holdings 
in U.P. Depit the data in a bar diagram to the scale 7 

(M.Sc. Ag. 1964) 


Size of holding | | Percentage Percentage 
(acres) of caltivators of area 


toto ч сл оҥ 90 -‏ سم سر 


ف ف ت ت فم ت فن ف خب 
©л сл сз -1 оо бо бо Ф‏ جب 93 


ت طن # ق ف ف SRE‏ 
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Solution:— 

The percentage of cultivators and the cultivated area in 
different sizes of holdings will be depicted by the compound 
bar-diagram. Corresponding to each holding size, two abjacent 
bars will be constructed where one of the two will represent 
the % of cultivators and the other % of cultivated area. 
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(3) Sub-divided Bar Diagrams:— T hese diagrams are 
suitable for the data given in the complex tables; where the 
magnitudes of the items are sub-divided-into sub-classes. 


Example (14) 


The following table gives the populations of. males out 
of the total populations for different years for a district. Draw 
a suitable diagram to show the populations given in the data. 


Year Total Population Population of Males 
1910 85761 45761 
1911 92821 47212 
1912 83728 41312 
1913 72511 37256 
1914 83123 42218 
1915 65735 34725 
1916 46849 23428 
Solution:— 


The bars are constructed for different years and their 
lengths are proportional to the magnitudes of the total 
populations. Then cach bar is sub-divided to show the 
populations of males for the different years. 
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| Diagram Мо. 3 
“Sub-divided bar diagram for total population 
& population of males" ; 

from (1910-16) 


~~~ 


ы 


V.S.——-+1 Small div.—3000 population 


1910 LELO 1912 


E Cee 
\9\з \91ц 1915 1916 


H.S.——-,3 Small divs.—1 year 


f =width of each bar 
2 Small divs. —= Spacing between two consecutive 
bars 
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Example (15):— 

The following table shows the expenditure incurred by 
the central government and the governments of States of type 


A & B in the Ist. five years-plan under the six major heads. 
Represent the data by means of a suitable diagram ғ 


Expenditure in the Ist five years-plan (in crores of rupecs) 


Subject Central Type A States Type B States 

Agr.& development 186:3 127:8 37-6 
Irrigation & power 265:9 206:1 81:5 
Transport & 
Communication 409:5 56:5 17:4 
Industries 146:7 17:9 Т1 
Social Services 191:4 192:8 28:9 
Miscellaneous 40:7 10:0 то 
"Totals 1240:5 610:1 173-2 
Solution:— 


The date can be represented by means of a sub-divided 
bar diagram, Three bars will be constructed and their heights 
would be in proportion of the total expenditure of the three 
types of governments respectively. Further each bar will be 
sub-divided into six parts and the height of each part will be 
proportional to the expenditure incurred on it. 


V. S.— ——›1 Small div.—20 crores of Rs. 


Graphs & Diagrams 
Diagram No. 4 


Agr. & Develop. 
Irrigation & power 


Industries 


Т 


Social Services 
Miscellaneous 


C. GOV. G. of T. A. G. of T. B. 


Transport & Comm. 
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The diagram provides the information about the total 
expenditure of the central, type A andtype B governments. 
This also gives the distribution of the total expenditure of 
three types of the governments with respect to the items of 
expenditure (Transportation, Agriculture etc). 


Example No. (16) 


Represent the data given below by a suitable diagram. 
The table gives the birth rates and the death rates of the six 
countries of the world during the year 1987. 


Name of the Country Birth rate Death rate 
Egypt 48:5 27:2 
Canada 19:8 10:2 
United States 170 1r2 
India 34-5 22:4 
Japan 30-6 17:0 
Germany 18:8 11:7 

Solution:— 


These birth and death rates сап be represented by sub- 
divided bar diagrams. Six bars are constructed with their 
heights proportional to the birth rates. Further the bars are 
Sub-divided into 2 parts, the lower one shows the death rates 
and the remaining upper portion shows the difference between 
the birth and the death rates. 
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Diagram No. 5 


Showing the birth & death rates 
of 
six countries in 1937 


Difference 
Death rates 


—2 people 


1V.— 


V.S.———18mall d 


14483 
tPEUED 
&чешләсу 


H.S. ———›>2 Small divs.— width of each bar 
—Spacing between two 


consecutive bars 


62 Statistical Mathcds in Education 
Percentage ub-divided bar-diagram:— 
In this diagram, the total values are taken equal to 100 


and the component parts are expressed in percentages. As the 
length of each bar in this diagram is the same, so it cannot 


give the comparison between the absolute magnitudes of the 


components. But the relative changes in the component parts 


are satisfactorily compared. 


Example (17):— 


With the help of a suitable diagram, show the absolute 
as well as the relative changes in the students population of 
the college A and B in the different faculties in 1947. 


Faculty College A 
Arts 350 
Science 500 
Commerce 650 
Law : 00 
Totals 1800 
Solution:— 


College B 
290 
250 
150 
150 


720 


(а) For the comparison of the absolute changes in the 


students population, the sub-divided bar diagram will be a 


suitable diagram. 
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Diagram No. 6 (a) 
howing the absolute changes in Students population in 1947 


Arts 


Science 
Commerce 
Law 


AA 
AA 
^^ 
^^ 
AN 
An 
AN 
AN 


V.S.——1 Small div.—30 Students 


College A College B 
H.S.— — 5 Small divs.—width or each bar 
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| 
| 


(b) The relative changes in the component parts can be 
shown by the percentage sub-divided bar diagram. For this 
purpose, first we change the absolute magnitudes into the 
percentages. | 


College 
(B) 
absolute Relative absolute | Relative 
magni- magnitude | magni- | magn itude 
tude tude in % 
Arts 350 200 $29 x 100= 
27:77 
Science 500 = 250 259 x 100= 
34:72 
Commerce 650 150 1$9x100— 
20°85 
Law 300 120 129 х 100= 
16:66 
"Totals 1800 720 100 
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Diagram No. 6 (b) 
Showing the Relative Change in 
Students population in 1947 


Arts 


:t Science 
Commerce 


Law 


у.5:—— +1 Small div.— 5% 


у ә8әүүогу 
g 2321109 


Н. S.— — —. 3 Small divs. — width of each bar 


4 "у " spacing between two bars 

Example (18) 

i For the following data, show by a suitable diagram the 
| comparison between the relative and the absolute changes. 
} 

Principal heads Year 1938 — 39 Year 1939—40 
| of Revenue (Lakh of Rs.) (Lakh of Rs.) 
| Custom 4050 e 4588 

Central Excise 868 652 

Corporation tax 204. 238 

Incomctax 1374 . 1420 


Other heads 974 1262 


45 
Gie 
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Solution - 


(a) For the’ comparison of the absolute changes, the. 
subdivided bar diagram will'be a suitable diagram. 


Digram No: 7(a) 
Showing absolute changes in amounts‘incurred on various. 
heads іп «1938-39 & 1939-40. 


RK Custom 
ААА C. Excise 
AAA 
AAA 
AAA Corp. tax 
h ААА Income tax 
n 
AAA Other he 
а Ху ads 
AA AL 
ne AAA 
a АЛА 
nd КАМ 
AA 
= ЛАЛ 
о AA N 
a AAA 
|| 
E 
dz 
5 
= 
Е 
К 
~ 
on 
ш 
> 


MÓN 
Н. S.— — 3 Small divs. —width of each bar —1 year 
4 4: " ==врасте between''two bars 
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(b) For the relative, changes in the component parts, the 
percentage -sub-divided bar diagram will be. constructed. ' For 
this purpose, we ,change the absolute tnagpi into the 


percentages.as, shown idis 


Year 1938— 39 


Year 1939—40 


ii «Heads of 
Revenue 
Absolute | Relative | Absolute |. Relative 
magni- magnitude magni- | magnitude 
tude in 9$ саде || 202590 
Custom ~ | 4050 543 4588 564 |. 
Centra! Excise | 868 116 652, | 8:0 
| Corporation | 204 27 938 2:9 
1 tax 
| Income tax.. 1374 18:4 1420 17:5 
Other heads . 974 13:0 1262 15:2: 
"Totals 7470 100 8160 100! 
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Diagram No 7 (b) 
Showing Relative changes in amounts 
iucurred on various heads in 


1938—39 & 1939—40 


se Custom 
E: C. Excise 
E Corp. tax 
= Income tax 
E Other heads 
n 

+ 

| 

| 

Л 

> 

H. S. — — —3 Small divs. —width of each bar—1 year 


Ae "  z Spacing between two bars 

Bilateral Bars: In this case, the bars are drawn above or 
below the base line (in the case of vertical bars) and to the 
left and right perpendicular to the base line (in the case of horizon- 
tal bars). The bars above the base line or to the right of the base 
line are used for the+ve quantity and the bars below the base 
line or to the left ofthe base line are used for —ve quantity 
in the data at hand, 
Example (19):— 


The following table gives the number of houses in ten 
small towns of India during the census of 1941 and 1951. 
Town : 1:25:58 04756 ^6... 7. БОТО 
No. of houses 
in 1941: 200 300 400 480 520 300 400 280 750 570 
No. of houses 
іп 1951: 250 340 420 495 530 290 380 250 710 520 
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Represent the increase or decrease in the number of 
houses in 1951 in comparison to the census of 1941. 


Solution: — 


In some cases, there is an increase. while in other there is 
a decrease, The arrangement is made such that the towns which 
show an increase are written first and the towns which show 
decrease are written later on. Within these graphs, the towns 
have been arranged according to the magnitudes cf the changes 
(increase or decrease). 


No. of houses 
in 


Town 
Difference 
194. | 1951 
1 200 | 250 +50 
2 . 300 340 +40 
3 400 420 +20 
4 480 495 +15 
5 520 |. 530 +10 
6 200 290 —10 
y 400 | 380 —20 
-8 280 250 — 30 
9 750 710 —40 
10 57 520 —50 


In the diagram, the +ve changes (increase in the no. of 
houses) are shown by the bars drawn to the right of the base 
line and those showing the —ve changes (decrease in the no. 
of houses) are drawn to the left of the base line. 
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Example No (20 == 


a 


Represent the following data by a suitable diagram 
showing the differences between the proceeds 8: costs. 
У ih ! hz 


Year Total Proceeds Total costs 
(in thousands of Rs.) (іп thousands of Rs.) 
1955 i ;:22*0 19:5 
1956' 273 217 
1957. 28:2 30°0 
1958 30:3 25:6 
1959 Зо 26.1 
1960 33:3 34-2 
Solution:— 


First of all we take the differences of the total proceeds 
,& the total costs and then -ve differences ,are written first 
‚апа —ve differences after them. Within these groups, they 
‘have been arranged according to the magnitudes of the changes 
( (increase ог decrease). — : t 


» Year : 1959 1956 1958 1955 1957 1960 
Difference: +66 +56 +47 +25 —90 —1:8 
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Diagram No. 9 
Differences between Proceeds & Costs 
from 1955--60 


H. S. — — >1 Small div.—200 Rs. 


(2) Two Dimensional Diagrams : (Area Diagrams) 

In these diagrams, the areas are proportional to the 
magnitudes ofthe data, The common diagram of this type 
are rectangles, pie and square diagrams. The square and pie 
diagrams serve the same purpose but the pie diagrams are the 
easiest to draw and they can be made accurately. Hence the 
pie diagrams are commonly used in place of square diagrams 

(a) Pie/circle diagrams: When the differences 
between any two quantities to be compared are large, bar 


diagrams can not be vsed, as one of them will be extremely 
small. 


| 
| 
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and the other large. In such cases, the squares or Pie diagrams 
are used, For drawing the circles, the squar roots of the quan- 
tities are taken as the radii of the circles and so the areas of the 
circles are proportional to the magnitudes of the data. 

(b) Sub-divided Pie diagrams : When we want to 
compare the totals as well as their components with one 
another, we may use sub-divided Ріс diagra.is. The total value 
is equaled to 27 —360? and then component parts are expre- 
ssed in terms of angles. Having determined the angles corres- 
ponding to different components, the circle is divided into 
different sectors on the various angles pre-determind. 
Example No. (21):— 

Represent the following data by a suitable diagram— 
Countries Production of cane sugar in 1938-39 

in Quintals 1000’s omitted 


1. India 2750 
2. Java 1550 
3. Hawai 835 
4. Columbia 51 
Solution:— 


; Here the difference between the quantities 51 & 2750 
is very large. Thus Pie diagram will be а suitable diagram to 
represent the data. Now we construct the following table—— 


Quintals (1) | Square 


[218 
Radiigin 


Countries 0000" S roots 
omitted (2) inches 
l. India 2750 52:44 
2. Java 1550 39:37 
3. Hawai 835 98:90 


4. Columbia| 51 74 
The column (2) gives the square roots of the figures 
written in column (1) and the column (3) contains the 
the numbers which are obtained by dividing the numbers of 
the column (2) by 50 which are the radii in inches. The circles 
with these radii are arranged in the descending oder. For 
finding the scale, we calculate the area of the Ist circle= 
‚ & x (105)2—3:14 x 1'1025 sq. inche . 
Thus 3:14 x 1-025 sq. inches=2750,0000 quintals 
== 796,5000 quintals. 


`. 1 sq. inch 
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Example 22:— 


The allocations for Madras & U.P. States under the 
second 5 years plan are as given below— 


Heads Madras State U.P. State 
(lakhs of Rs.) (lakhs of Rs.) 

l. Agr. & Cumunity 

Development 3535:5 6763-9 
2. Irrigation & Power 7125۰0 8042:5 
3. Industry & mining 1520:0 1643.4 
4. Transport . 875 1723:2 
5. Social Services 4065۰9 6863:8 
6. Miscellaneous 2592-1 272:8 


Draw a sub-divided Pie diagram to compare the cost of 
development under each head in the two states? 


Solution:— 
The solution for the problem is given in the tabular form : 
as shown on the next page. 


ion 
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Example (23):— 

Represent the data of the following table by means o 
a suitable sub-divided Pie diagram— 

Clearing house Statistics in 1940-41 and 1947-48 in 


certain cities— 


Total amount (Rs.) 


City 
1940—41 | 1947—48 


-—— 


Bombay| 80,282 255,264 
Calcutta | 100,853 259,996 
Delhi 2,853 12,646 
Kanpur 1,920 10,983 
Karanchi 4,676 27,481 
1,633 4,954 
10,865 34,794 
4,228 51,896 


Totals | 2,07,260 | 658,014 


Solution: — 


The steps in the construction of Pie diagram are given 


below— 
(i) First of all, calculate the square-roots of the total 
amount of clearing house returns for the two years i.e. calcu- 
late 4207260 and 4658014 which аге 455 & 811 respectively. 
(ii) Divide these quantities (455, 811) by a suitable 
number to obtain the radii of the circles. 
(iii) Then calculate the angles corresponding to the 


amounts for cach city 1n the separate sessions. The calculations 
are done in the following tabular manner— 
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Cities 


Bombay 
Calcutta 
Delhi 
Kanpur 
Karanchi 
Lahore 
Madras 


Others 


Totals 
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Year 1940—41 
Angles calculated 
(degree) 


80232 x 360 у 
207260 139:4 
100853 x 360 
— 907260 
2853 x 360 _ 5-0 
~ 207260 
1920x360 _ 
33072605 7 
4676x360 __ 
` 207260 
1633 x 360 SS 
207260 — ^ 
10865 x 360 _ 
207260 
4228 x 360 7-3 
207260 


-175:2 


18:9 


360 


Year 1947—48 
Angles calculated 
(degree) 


255264 х 360 1 59.7 
658014 

259996 x 360 _ 145.5 
658014 

12646 х 360 _ 6.9 
658014 

10983 x 360 

— 658011 - 

27481 x 360 

"658014 - 

4954 х 360 9, 

658014 — 27 

34794 x 360 
658014 _ 

51896 x 360 
658014 


=6:0 


=150 


=19:0 
—28:4 
360 


нае раа бе T ЗЫ Б 


Finally we divide 455 & 811 by a number say 400 to get 
1.14 and 2:05 as the approximate radii in inches and the 
sectors on various angles are cut from these circles The area 
enclosed within each sector represents the magnitude of the 
amount returns in the specified years for different cities. 

Scale : 1 sq. inch —52389 Rs. 
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Diagram No. 12 


*representing clearing house statistics’ 
Year : 1940—41 Year : 1947—48 


Scale : 1 sq. inch— 52389 Rs. 


Others 
Bombay 
Calcutta 
Delhi 

æ Kanpur 
Karanchi 
Lahore 
Madras 
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Example 24— 

The following table gives the total outlay on rural deve- 
lopment proposed in the first Five Years Plan and its break 
down into major items. Give a suitable diagramatic represen- 


tation of the data. (M.Sc Ag.Agra, 1965) 
Amount 
Item (in crores of 


rupees 
ML Hpes € 


Agr. & community development | 360:43 


Irrigation EXIIT 167:97 
Irrigation & Power (multipur- | 265:90 
pose projects) LP 

Power wes бао 564, ө 2%» OE е ols өөө 127:54 


"Transport & communications’ | 497-10 


Industry ГТО 173:04 
Social services LLLI . 339:81 
Rehabilitation UD 85°00 
Miscellaneous 51:99 
CES RR P Y^ ODS CS EEA 
Total 2,068778 


———————————— A 


Solution:— 

The suitable diagram for representing the above data is 
sub-divided Pie diagram. The total area of the circle will repre- 
sent the total amount i.e. 2068°78 crores of rupees and its 
components will represent the various items of expenditure 
under the plan. 
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Amount (in 


crores of Angle 
Rupees) 
DER 


Item 


LAT YE REM UMS 


Agr. & com. development ....--- 300-43 627° 
Irrigation & poWer.--- nnn 167-97 29:2? 
Irrigation... ee eem 265:90 46:3? 
POWEL... eee t te 127:54 22:2? 
Transport & communication......- 497-10 86:5? 
Industry «n 13704 30:12 
Social services... 339:81 59:1? 
Rehabilitation .....- | 85:00 14:8? 
Miscellaneous.....-. nn] 51:99 9:0? 


LL 
3599 = 
36° 


2068:78 


Let the radius of the circle be 2", then 7 r= Area 
or 22 X4 sq. inches =:2068°78 crores of rupees: 


s. Scale : 1 sq. inch=1 64:56 crores of rupees. 
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Diagram No. 13 
Showing the total outlay on Rural development = 


o ^ ( 4 
Fed Y LZ 
яс rx 
RAR an ick ы. z 
it er 


МА а 
МА и ЈУ 
Y MM 


1 sq. incl.—164:56 crores of Rs. 


Scale : 


35 mum, 


Knsnpu[«- 
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Rectangles: The rectangles are the two dimensional 
diagrams and are used when the two magnitudes which are 
related to a third one are to be represented on the same diagram. 
For example, the average yicld of wheat multiplied by the 
total acreage under wheat represents the total production of 
wheat. To represent it diagramatically, one side of the rectangle 
is taken proportional to the average-yicld and the other is 
proportional to the total acreage; the area of the rectangle gives 
the totat production of wheat. Subedivided rectangles are used 
when one of the magnitudes is divided into several 


components. 


Example No (25) 


An analysis of the monthly wages paid to workers in 


the two firms A & B gives the following resulis— 
Firm A Firm B 
80 100 


No. of wage earners 
415 


Average monthly wage (Rs.) : 52.5 


Represent the above facts diagramatically ? 


Solution:— 


Here the average wage multiplied by the total number 


ives the daily paid roll. Hence the most suitable 
he above data isto construct the 


two rectangles, one for each firm. One side ofthe rectangles 
will be proportional to the average wage and the other will be 
proportional to the total no. of workers employed in a firm, 


of workers g 
diagram fot representing t 
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Diagram No. 14 
Showing the average monthly wage and number 
of wage earners 


V.S.— 1 Small div.— 5 units 


H.S.— — 1 Small div. —5 Rs. 


The family budget data, where we compare the 
absolute totals as well as the percentages of the various 
ttems to the total, is very suitably represented by the 
rectangles. In this case, the height of each rectangle is kept 
100 and the width is made proportional to the total 
magnitude. 

Example No. 26 

The following table gives the details of the monthly 
expenditure of three families. Represent them by a suitable 
diagram— 


Item of the Family A Family B Family C 
expenditure Rs. An. Rs. An. Rs. An. 
Food 12—0 25—0 28—0 
Clothing 2—8 8—0 14—0 
House rent 2—0 4—0 7— 
Education 1—0 5—0 7—0 


Miscellaneous 2—8 8—0 14—0 
Totals 20—0 50—0 70—0 
ee МИНИ 


Graphs & Diagrams 85 


Solution:— 


First we prepare the following table:— 


Family A Family B Family C 


Item of 
expenditure | Actual Actual Actual 
expen] 96 | expen| % | expen} 9% 
ses ses ses 
Food 12:00 | 60:00 | 25:00| 50:00; 28 | 40.00 


Clothing 2۰.0 | 12:50 | 8:00 | 16:00, 14 | -0.00 
House rest 2:00 | 10:00) 4:00 | 8:00 | 7 | 10.00 


Education 1:00 | 5:00 | 5:00 | 10:00 7 10.00 
Miscellaneous | 2:50 | 12:59 | 8:00 | 16:00! 14 20.00 


Total 20:00 | 100 50 | 100 70 100 


The heights of the three rectangles will be taken as 100 
for each and their width in the ratio 20:50:70. Each rectangle 
will be further divided into component parts according to the 
figures in the percentage column. 
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Diagram No: 15 
Showing the family-budget data 


RNAS 


4 — Miscellaneous 
— Education 
епа LE | | 


< ERARE HHHH — House rent 
TS к RESFAIRESESESESESESTS HEEE SS p 
NS SNR PARERE XR E — Clothing 


r VAS ws ER 
FARR и“ NAS ASUNA —-Food 


д 
AS 


T ATA 


Family A EAB Family C 
H.S.—— — 1 Smali div. — 5 Rs. 


V.S.—1 Small div =5% 


gram represents— 
of the item to the total by the height of the 


corresponding component part of thc rectangle. 


% 


) The actual expenses on each item by the area of the 
The 


corresponding component part of the rectangle. 


) 


(3) The total expenses for the family by the whole area 


The above dia 
of the rectangle. 


(2 


(i 
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Example (16):— 


Represent che following data by a suitable diagram — 


Avcrage | | Area under crop [Total 
Year yield Total yield Irrigated |Non-Irrigated |(acre 
Md/acre 3 age) 
o 
I = 
45 
1940- 9.5 427.5 22 23 
4l | 
1941- 8.4 361.2 23 20 43 
2 


Solution:— 


Here we note the rclition— 


Average yield x toral area—total yield, 

So the rectangles аге ће suitable diagrams to represent 
the given data. Here we construct the two rectangles on the 
same horizontal axis taking their width in the ratio 95:84 
and heights in the ratio 45:43. Each rectangle will be further 
divided into two components according to the irrigated and 
non-irrigated areas. 
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Diagram No. 16 


Showing the average yield md/acre & area under crop 
irrigated and non-irrigated 
- = 


4 Non-Irrigated 
Irrigated 


2 acres area 


pb | 
p |>] 


53020202! 
PI 


asaz; 


IIIS 


BEN 


m EG cp 
INAIAIAIAIALA IA АЧА ЧАА Ц 
O'R CONS SNS TCL ALALAIA AIA 


1940—41 1941—42 
H.S.——-+1 Small div.—1 Md./acre 


A 
3 
[Pi 
21727272 


E 
[zil 
>] 

ral 
"ud 


V.S.-+1 Small div. 


а 


Three dimensional Diagrams : When the ratio 
between the two quantities to be compared is so large that the 
two diamentional diagrams are not suitable for representing 
them, we may represent them by three dimensional 
diagrams. The volumes of the diagrams represent the magni- 
tudes of the quantities to be compared. In this type of diagrams 
the spheres, cubes, and the prisms are common but cubes are 
generally drawn in practice. 
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Measures of Central tendency з 
(averages) and dispersion 


M 


The histogram or frequency curve gives the general idea 
of ths distribution of th: variate aid henc: the frequency 
graph can be used to study and compare the given distribu- 
tions. But the study through th: graphs depend: upon the 
accuracy and skill of the eye, which is an uncertiin factor. 
Thus the study and comparisons by it may not b> very 
reliable. Therefore it is necessary to know certain features of 
the distribution, which give an idea of the distribution and can 
be determined arithmatically. Two such features of the distribu- 
tion are its average and dispersion (variation). 

Average : An average is the value of the variate which 
claims to represent the distribution. Some of the variate values 
will be above this value and others below it and so it is known 
as a measure of the central tendency. 

There are three averages (i) Mean (ii) Mode and 
(iii) Median. 

Exp. No. 1 

What are the properties of an ideal average. In what 
circumstances, would you consider the mean, mode and the 
median, the most suitable statistics to describe the central 
tendency of the distribution ? 


Sol:— 


An ideal average should have the following properties:— 
(1) It should be rigidly defined and its value should be. 
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definite which can be determined mathematically 
(2) It should be based on all observations 
(3) Its calculation should not be lengthly and tedious. 
(4) It should be least affected by sampling fluctuations, 
(5) It must be capable of algebraic treatments i.c. if the 
averages of the component series are known then the average 
of the whole series should be expressible in terms of the averages 
of the component series. 
Relative merits & Demerits of different averages : 
(1) Mean (Arithmatic average) 


Advantages:— 
(1) It is readily understood and well defined. 


(2) It is based on all observations. 

(3) It is easy to calculate. 

(4) In most of the cases, it is not affected by the samp- 
ling fluctuations. 

(5) It is capable of algeberaic treatments. 

(6) It gives weights to all items which are directly 
proportional to their sizes. 

Disadvantages:— 

(1) It some times gives values which may not be phy- 
sically possible c.g. the average number of eges laid by a hen 
as 18:5 per month. 

(2) It givcs undue weights to the extreme items. 

(3) It can not be calculated in the cases where the 
extreme ends are open. 

(2) Mode : 
Advantages:— 

(1) In most of the cases, it is easily found. 

(2) It can be found out from the graph merely by an 
eye inspection. 

(3) It can be found out for the distributions where the 
ends are open, 


(4) It is the type that to the ordinary mind, seems to 
be the best to represent the group. 
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Disadvantages:— 
(1) No weights are given to the extreme items, 
(2) A clearly defined mode does not always exist 
(3) It is not capable of algebraic-treatments. 


(3) Median : 


Advantages:— 
(1) It is well defined and can be calculated easily. 
(2) It does not give undue weights to the extreme items. 
(3) It is possible to calculate even in the cases where the 


intervals are open. 


Exp. No. 2 
What is an arithmatic average ? What are its properties ? 


How would you calculate the Arithmatic Average ? 
Sol:— 

The arithmatic mean of a series of individuals ıs obtained 
by adding up all the values ind then dividing the total by 
the number of individuals. 

For example, ifthe heights of 10 randomly chosen 
plants are-52", 55", 57", 61", 64" 65", 67", 68", 70", & 71”, 


then the arithmatic mean or simply the mean of this sample 


of 10 plants will be the sum (52+55-+......... +71=030) 
divided by 10. i.c. mean== 9. 3' 
Im General, Ix a es ,x, are the values of n 


Be. Sere 
measurements, the arithmatic mean X of the X, 1s defined 
by the relation— 


Where Xj) Xa , Xn are the given valucs. 
n= number of items 
X—the sum of 
X= Arithmatic mean of X, 
Now, the formula (i) will be read as “The arithmatic 


E ; D 
mean of x^, is the sum of x’; divided Ьу thei number. 
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Exp No.3 
The yield of rices in 12 equal plots of a village are 
given as follows (in maunds)— 


68, 80, 94, 104, 120, 114 
125, 130, 130, 140, 141, 145, find the mean yield + 


Solution:— 


If the yields are denoted by x;, then 


xx. 68--80-......4+145 1392 - 
T= T چ‎ =116 maunds/plot. 


х= 


If the value x, occurs f, times, xs occurs fs times, and 
so on, then 


ў +x + Т0 £x, Xfx _ 54 AT ON ( ii) 
f,Ff...... +, Sf in 
as Xf--n 
where X=Arithmatic average and x, Ха...... хо aie 
the given values of the variate ху. 
CEU f, are the respective frequencies 


To make the procedure clear, let us take the yield of 
milk by 80 cows of a dairy farm on a certain day (in 
kilograms) — 

Milk-yield : I1. 14 17 225 30 32 


No. of cows 6.09 12: TOGO TOE OF кита 
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To calculate the arithmatic means of the milk yield, we 
prepare the following table— 


piyi (1) No. of cows (2) (3) 
(in Kgm.) f fx 
11 6 66 
14 9 126 
17 12 204 
21 | 16 336 
25 21 525 
30D 9 270 
32 1 224 


Totals 80=sf 1751—xf 


In column (1), we put the variate values and in column 
(2) their respective frequencies against them. In column (3) 
we put the product of (1) & (2). For example, the first value 
66 of the col". (3) is the product of 11 & 6 which are the Ist. 
figures in the col". (1) & (2) respectively. Similarly, the 
other figures of col". (3) are attained. Thus, we get— 


Xfx sum of col”. (3) 1751 


i: —1751 51.8875 k 
Sf eam of col (2) SO ie 


== 


This formula can be applied іп а grouped frequency 
distribution after the classes have been replaced by their 


respective mid-values. The procedure of calculation will be 
clear from the following Exp.— 
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Exp. No. 4 


Given the following distribution, calculate the mean: 


Height of plants No. of | Height of plants | No. of 
(in cms.) plants (in cms.) plants 
375—425 
12:5—17:5 2 4 
42:5—47:5 
17'5—22:5 22 6 
| 475-525 
22:5-75 19 | 1 
| 52:5— 57:5 
27:5— 32:5 14 | 1 
32°5— 37-5 3 
Solution:— 


Й 


The mean is given by the formula х=: where the 


computations of xfx—n are given below in the tabular form—- 


(1) | (2) | (3) 
x f fx. 
15 2 30 
20 22 440 
25 19 475 
30 14 420 
35 3 105 
40 4 160 
45 6 270 
50 1 50 
55 1 55 


| Totals | 72—xf 2005—xfx | 
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In the соја, (1) we have put the mid-values of the 
classes and in (2) the class frequencies respectively. Thus, the 
product of (1) & (2) are put in the col". (3) 


Therefore, sfx is the sum of all figures in the соја, (3) 
and xf—n. the sum fo all figures in the col.(2).Hence we have 


=~ = 27°85 centimetres. 


fx 9008 
m7 ® 


For a short cut method for computing the arithmatic- 
mean, the formula is— 


X—A fd 
n 


where A is the assumed mean and 
d=(x—A) the deviation of the variate from the 
assumed mean. 
Properties of the Arithmatic Mean: 


(1) The sum of the deviations of the velues of x, from 
their mean X is always zero i.e. if хт Karsan ха be the variate 
values and X be their mean, then (x —-X),(x,— X ) С.х) 
etc. are their deviations form the mean and the sum of 
deviations is zero. i.e. X(x—X)-0 


(E) HE E KN Xy Бе the arithmatic means of k 
distributions with respective frequencies пу, ns,...... ny then 
the mean X of the whole distribution is given by— 


E nx Hu DkXk 20Х 
X njd...... Ny zn 


(3) If xi, хр...... x, be the variate values _ With their 
mean X and a variate y=ax +b is obtained, then Y =X b, 
where a & b are constants. 
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Exp. No. 5 
lf ха be the ‘n’ values of any measurement 
with their respective frequencies f,, fa eee f, and mean X, 
then show that 
= xfd 
WARTE, 


where A is the assumed mean and 


d is the deviation of x from A 


i.e. 4=(х—А) 
Sol — 
4 We define pus 
—Xf(x—A-FA) 
mi n 
аА) | АЗЕ 
n n 
Xf(x—A) PA CB 
n n 
S CI cS me Е (i) 
n 
or X— ACE 
n 


In future, for the computation of mean, we shall apply 
the above (i) formula which reduces the bulk of calculations 
and hence called the short cut formula for mean. 


Exp. No. 6 


Using the short cut method, compute the mean ofthe 
following data— 
(2) No. of branches/ 
plant (x) : 5 10 15 20 25 30 35 40 45 50 
No. of plants (f): 20 43 75 67 72 45 39 9 8 6 
(b) Age (years) 0—10, 10—20, 20—30, 30—50, 50—80 
No. of pesrons : 61, 49. 40, 60, 28 
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Solution:— 
(a) To calculate the mean of this data, first we perform 


the following table— 


The assumed mean і.е. A should be taken at such а 
value of x which divides the whole distribution approximately 
into two equal parts, ѕо that some of. the ‘d’ values be —ve 
and the others be +-ve- Here, we have taken А =25 and col". 
(3) is obtained by subtracting А =25 from the figures of 
соја, (1) as the first value in col”. (3) is 5—25=—20 and so 
on, the others. The col". (4) is obtained as the respective 
product of (2) & (3). Then we compute 


—1070 .8 —22- 
—25-- Gag 25—28—222 


X—22:2 no, of branches/plant. 
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(b) To calculate the mean of this data, the following 
table is performed— 


The col”. (1) of this table has been obtained dy replacing 
the classes by their respective mid-values and col”. (3) is the 
result of subtracting A=25 (assumed mean) from the figures 
of col". (1). The соја, (4) is the product of (2) & (3) cols, 
with their respective figures. Then we compute the mean 
эй 

п. 
eT 110 _ 5 
= 25 593 =25+0 47 


“. X=25°47 years. 


K=A+ 
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Example No. 7 
What is median ? Illustrate by an example, how would 


you calculate the median ? 


Solution: — 
Median: The median is defined as the value of a 


variate which divides the distribution into two equal parts. 
Thus, half of the values lie below the median and half above it. 


Calculation of Median : 
I—(a) Simple series of odd items : 

In a simple series, the variate does not repeat itself. In 
such a series, if the number of items is odd, the computation 


of the median is very easy. 
First we arrange the values either in the ascending or in 


the descending order of their magnitudes and then find the size 
of (=>) item in this series. It will be our desired value 


of the median. 
For illustration , consider the following example of 9 


items with magnitudes 2, 5, 7, 3,11,9, 8, 4 & 6, 
To find the median, the items are arranged in the 


ascending order of their magnitudes i. e. 

2, 3, 4, 6, 6, 7, 8, 9, 11, then size (Уа 
givcs the median, Therefore 

Md =size of (ee item 

=size of 2) item —size of 5 item 
2 
=6 
_ 7. Median=6 


I—(b) Simple series of even items : 
In this case, the median is the simple arithmatic average 


of the size of (=) item and (= +1 ЈУ item when the 


values have been arranged either in the ascending or in the 
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descending order of their magnitudes. For example, take the 
case of finding the median from the data of eight ( even 
number ) values— 
62, 65, 67, 64, 72, 53, 59, 55 
The ascending arrangement is 
58, 55, 59, 62, 64, 65, 67, 72. Then the size of 


п үш. нү ӨБҮШ. io cus —62 
(+) item= size of (+) ic. 4 item 


& size of (441 item=size of 5' item --.64 


Hence, Md.=Simple arithmatic average of the size of 


( ==) Gu i тешз 
— 62464 _ 63 
2 

-. Median =63 
Ungrouped data of discrete variate: 

If a variate x, repeats f, times, x4 repeats Ё, times and so 
on ie. if we deal witha frequency distribution of the 
following type— 


Variate value (x) frequency (£) 
XT f 
X: f 
ха у £ 
x £ „then there аге 


following steps in the calculation of the median. 

(i) Arrange the items cither in ascending or descending 
order of their magnitudes, if they are not so. But usually in 
such a case the data is already arranged and а fresh 
arrangement is rarely required. 

(ii) Compute the cumulative frequencies of the variate 
values. 
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(iii ) locate the median item which is = where 


n 15 the total frequency, 
(iv) Find the value of the median which is the size of 


the (E by item. : 


Consider the example given below to compute the 
median— 

p 1 2 34 4: БӨ 

f:20 15 17 19 18 15 


For the computation ofthe median, we proceed as 
follows— 


x f Cart 

1 20 20 

2 15 35 

3 17 52 

4 19 “71 

5 13 84 4 
6 15 99 


Since the median is the size of the (= y item, 


1 99--1 \th А 
the median here, is the size of ("== = 60"item. 
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By inspecting the col". of cumulative frequencies we note that 
the 50% item falls Opposite the variate value 3 and so the 
median here is 3. 


In case, when the total of all frequencies is an even 
number, the median will be the mean of the size of 


n \th n th. 
(= ) апа (2+1 ) items. 
3—Grouped data of discrete & continuous variate: 


In the case of the grouped data, the arrangement. of the 
items according to their magnitudes is already done. What is 
equired next to be done, is to calculate. 
(i) The cumulative frequencies, 


(ii) To locate the median-class, which is poe c.f. 


(iii) To determine the value of the median by applying 
the formula— 


Md=L+5— x i 


: 1 
where Md— —->median, metti 
L——-~lower limit of the median class 
i——-~class-interval of the median class, 
f——-+frequency of the median class 
с— — cumulative frequency of the 


class following the median-class 
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For illustration, consider the following frequency 
distribution to find out the median— 


class frequency (Е) 


To calculate the median for the above data, first we 
prepare the following table for c.f. 


| class frequency ctf 
(Е) 

| 
0—5 4 4 
5—10 6 10 
10—15 10 20 
15—20 16 36 
20—25 12 48 
25—30 8 56 
30—35 4 60 
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Here, we want the value of ( ae у item, 


which isl ocated in the class 15—20. Then the median is 
determined by the formula— 


Md=L4+" xi where m= = 
30:5 —20 L=15 
—124-3:28 #=16 
`. median— 18:28 approximately. i=15 


Example No. (8) 

What do you mean by the Quartiles ? Calculate the 
median, the upper and the lower quartiles from the data given 
below— 


% of recovery} No. of |% of recovery | Мо. of 
of sugar factories of sugar factories 
on cane on cane 
8.0—8.2 2 | 949.6 10 
8.2—8.4 5 9.6—9.8 7 
8.4—8.6 4 9.8—10.0 6 
8.6—8.8 11 10.0—10.2 3 
8.8—9.0 11 10.2—10.4 1 
9.0—9.2 11 104—106 | 1 
9.2—9.4 13 ; 


Total 
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Solution— 


As the median is such a value of the variate which 
divides the, whole distribution in such a way that half of the 
observations lie below it and the remaining half above it when 
the data is arranged in the ascending or descending order of 
magnitudes. 


Similarly, the lower quartile denoted by Ој is the 
value of the variate which divides the distribution in such a 
manner that one quarter of the observations lie below it and 
and the remaining 3 quarters above it when the values have 
been arranged in the ascending order of their magnitudes. 


The upper quartile denoted by Qs is the value of the 
variate which divides the distribution in such a way that 3 
quarters of the observations lie below it and 1 quarter above it 
when the data have been arranged in the ascending order of 


their magnitudes, 


Clearly О» is the median Qs, Qs and median, are also 
called the partitioning values of the distribution. Other 


partitioning values are Pertiles, Deciles and Hectiles. 


The mathematical formula for the computation of 


quartiles for the grouped data are— 


= оде xi 
where L-- >lower limit of Ој class 
n+l 
Chia Tie 
c—-»c.f. of the class following the О, 


class 


£— frequency of the О; class 
i— class interval of the Qi class 
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and 
= +2 xi 
where q= Se 
L—- lower limit of Оз class 
£- —frequency of Оз class 
i— «lass interval of Оз class 


c— c.f of the class following the Оз 
class 


| Тһе Q; class lies opposite of CX ) c.f. and О; class 


lies opposite о DE For the present example, we have 


the following table— 


% recovery 
of sugur No. of Ma 
on cane factories s 
x (6) 
80—82 2 
8:2—84 5 7 
8-4—8'6 + 11 
8:6—8:8 11 22 
8:8—9'0 11 33 
9:0—9:2 11 44 
9:2—9:4 18 57 
9:4--9:6 10 67 
t 9:6—9:8 7 74 
9:8—10:0 6 80 
| 100—102 3 83 
10:2—10:4 1 84 
10:4—10:6 1 85 
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85+1 _ 
тл 


where m— 


Hence the median classis 9:0—9:2 and so L—9'*0, 
1=0:2, f=11, c--33 


Md== =90 +43 02 


=9:0+0° c 


*. median=9'1818 


The lower quartile Q4 —L-- +a xi 


where озм =21'5 


Hence the lower quartile class is 8°6—8'8 and so 1 =8'6 


i—0:2, f=1], c=11 
200 D x02 


Q,—864- 
=86+0° 1s 
О,= 87818 
The upper quartile کک + ادرو‎ xi 


where q= ее 64:5 


Heace the upper Ne class is 9*4— 9:6 and so 
L—94, i—02, £—10, c=57 


~ 0,—94+ ЕШ x02 


—944-0 i 
· О,=955 
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ExampleNo. (10) 
The classification of 76 cows has been done according to 
one day milk in the following table. Find the median ? 


class interval No. of cows 


(Milk in Ibs) 


:8—10 4 
10—12 | 8 
12—14 12 
14—16 25 
16—18 15 
18—20 8 


20—22 3 
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Solution:— : 


109 


We first calculate the cumulative frequencies which are 


given in the following table— 


class frequency c Sft 
interval (f) 

(x) 
8—10 4 4 
10—12 8 12 
12—14 12 24 
14—16 25 49 
16—18 15 64 
18- 20 8 72 
20—22 3 75 


The median is computed by the formula— 


ма=і+ xi 
where m— E 
Hence the median lies in the class 
L=14, f=25, c—24, i=2 


38—24 
Md=14-+ 925 


—144-112—165:12 
*, Median—15:12 


x2 


nl 75+1 gg 
: = 


14—16 and 


50 
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Example No. 11 

The following table gives the marks obtained by a batch 
of candidates in a certain examination of History and politics, 
In which subject is the level of knowledge of the candidates 
higher ? Give reasons ? 


eS eee ۴ 
Roll History | Politics | Roll History |Politics 


No. No. 

1 42 46 9 40 30 

2 24 20 10 62 61 

3 38 4] 11 55 50 

4 35 43 12 54 63 

5 30 25 13 52 45 

6 45 54 14 47 56 

7 58 47 15 48 58 — 
(M.Sc. 

8 £0 36 | Agra 
1955) 


Solutien:— 


To find out the subject in which the knowledge of 
students is higher, we calculate the medians of the marks 
obtained in the two subjects. In the subject, for which the 
median is higher, the level of knowledge is higher. 

For this, we arrange the marks in the ascending order of 
magnitudes in each subject. 

History : 24, 30, 35, 38, 40, 42, 43, 45, 47, 50, 52, 54 ,55 
58, 62 

Politics : 20, 25, 30, 36, 41, 43, 45, 46, 47, 50, 54, 56, [^ 
61, 6 
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The median will be the size of == ЈЕ item 


—size of 8 item, since n— 15 

Median for History-marks—45 

Median for Politics-marks—4 6 

Hence, the level of knowledge of the candidates is higher 
in Politics than in the history. 

Example No. (12) 

Define Mode ? Illustrate its method of calculation ? 
Solution:— 

Mode: The most frequent or the most popular value 
of a variate is called the mode. It is. repeated at the greatest 
no of times. In popular language, when we speak the average 
stud ent or average rent, we generally imply the modal student 

/ or the modal rent. It is easy to locate, as it lies at the highest 
frequency. But if there are irregularities in the frequency 
distribution. the position of the mode could become indefinite. 
In such cases, the process of grouping will be applicable. In a 
discrete series the size of the variable which has the mix, 
frequency is the mode; while in a continuous series the mode 
will Бе located by interpolation in the modal group by the 
formula 


= A ; 
Мон EA ean 
where L means the lower limit of the modal class 
Ау Stands for the difference between 
the frequency of the modal class and 
that of the class which follows the 
modal class 
Ag——Stands for the difference between the 
frequency of the modal class and 
that of the the class which preceeds 
the modal class. 
j—-+Stands for the class interval of the 
modal class, 
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Let us consider the following age-distribution of the 


+ + Я Я Z | 
candidates appearing at the matriculation examination ОЁ, 
Patna University in 1937— 


Age-group | No. of stud. Agegroup | No. of stud. 


(years) (years) 

2-13 5 Yi—18 980 
13—14 48 1&—19 | өзү 
14—15 189 19—20 794 
15—16 303 20—21 515 
16—17 599 21—22 474. 


Total | | | n 


Here the modal classis 18-19 corresponding to the : 
highest frequency 981 and so, 1 


L—18, i—19-18—1, A,—981-980—1, A,—981— 
794—187 $ 
“. Mo=18+ 1 х1 
|— 140187 
=18+-005=18-005 
` Mode=18-005 years. 
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ExampleNo. (13) 


Define dispersion ? What are the different methods of 
measuring it, describe each briefly г (U.P. Board, 1962) 
Solution:— 


Having been known the value of the average of a series, 
we try to comput some stitistics which can give an idea how 
the values of the variate are scattered around the central value. 
This variation of the data from the central value is called the 
dispersion or Scatter or Spread or variability. 

Thus, the dispersion can be defined as the extent to 
which the magnitudes of the items differ from the central 


value. 


The following are the measures of dispersion— 
Q Range 
(3) Quartile deviation 
(ii) Mean deviation from (a) average (mean) 
) (b) median 


(iv) Standard deviation 


(i) Range: It is defined as the difference between the 
greatest and the least magnitude- of the items. It is very easy to 
calculate but its'use "а а measure of dispersion is very rare. 
Because it does not depend upon all the observations (items) 
and ‘in its computation no consideration is given to the value 


of the сепега! tendency: 


(ii, Quartile deviation : (Semi Inter quartile range) 
It is half of the quartile range i c. 
QD.— 99.21 
Itisa better measure of dispersion than the range and is 


used in elementary descriptive statistics, But due to its 


oftenly : 
ents, it is not used in the 


incapability of algebraic treatm 
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Advance-theory. Another reason is that it gives an idea of 
dispersion only of those items which lie between Q, & О, and 

so most of the items have no effect on the computation of 
dispersion. 


ili) Mean deviation (average deviatson) : As the 
computations of range & quartile deviation do not depend 
upon all the observation, so these measures of dispersion can 
not be said as the satisfactory measures of dispersion. One 
measure free from this objection is the mean deviation which 
is defined as the arithmatic average of the absolute deviations 
of the items from any measure (mean or median) of central 
tendency. Mathematically we write— 


хба 
ae 
where n is the total number of items, 
f—-+Stands for the corresponding 
frequency of the variate x 
апа idi—-- Stands for the absolute 
deviation of x from its average. 


Mean deviation= 


If the deviations are taken from mean, then it is called 
Mean Deviation about mean and if deviations are taken from 
the median, then it is called Mean Deviation obout 
median, The mean deviation also suffers from опе drawback 
that is of incapability of further algebraic treatments. 


For the illustration of the procedure, we calculate the 
mean deviation about mean from the following data— 


x: 25, 75, 195, 175, 22:5, 97:5, 32:5 
pre 4, GF 10; 165 951272 0887 4 
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To compute the mean deviation about the mean, we 
form the following table— 


(1) (2) (3) (4) (5) (6) 
x f fx ==х-М | |d] £|d| 
2:5 4 10:0 —15:5| 15:5 62-0 
7:5 6 45:0 —10:5| 105 630 
12:5 10 125:0 —5.5 5:5 55:0 
17:5 16 280:0 — 0:5 0.5 8:0 
22:5 12 270:0 +45 4:5 54:0 
275 8 220 0 +9:5 9:5 76:0 
325 4 130:0 +145) 14:5 58:0 
60=sf 1080 376:0 
=sfx = зр iil 
xfx 1080 
X زا ا بے لے‎ 
Mean X or M— = 60 


First of all, we calculate the mean X —18, then we take 
the deviations of all values of the variate of сој. (1) from their 
actual mean 18 as shown in сој“ (4), by ‘d’. Again in соја, 
(5), we write the absolute values of ‘d` and in (6)™ the 
product of (2) & (5) with respective figures. 

Mean deviation _ zld] 015 — 61266 

ean 6 
ive : Ene the mean deviation about the 


edian then first we find the median for the data and uo 
a the deviations of the values from this median. d n 
e 
p mean deviation about any  averagecan be foun 
the 
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by £irst computing that desired average for the given data 
and then obtaining the deviations of the values from that 


average. The rest of the Procedure is the same 


as indicated 
above, 


(iv) Standard deviation : Among all the measures 
of dispersion, the S.D. is most widely used because of the facts 
that 
(1) its computation is based on al] the observations, 
(2) the deviotions are taken from the mean and so the | 
- sum of squares of the deviations is minimum, 
(3) ‘tis capable of algebraic treatments, 
The S D. is the Square root of the mean of the squares 
of deviations from the arithmatic mean. Symbolically, the g reek 
letter sigma (в) denotes the siandared deviation, 


(хр SaaS Ase 
= [5% E j| gn GE 
n 


n n 
and o=,/ 3{(X—X)? = жЕ" = y 
n n RET | 


inplace of the actual mean for our convenience. Then the 
formula for S. D. will be 


FA rz El 
n n 
B 1 . 
and o= /3fd* «(y fora frequency distribution, 
n n 4 
For illustration, we summarize the steps in the 


computation of S.D, by taking the following! simple series of 
values— 


X : 192, 288, 236, 229, 184, 230, 348, 291. 330, 243 
(1). Choos the assumed mean ‘A’, A 
(2)- Take the deviations 'd' from this assumed mean, 
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3. Square the deviations i. e. calculate ‘d”’. 


The results of the above steps for the given series are tabulated 
below— 


The S. D. is given by 


= 
et 


= У (2661:49) 
251-59: 


Totals +1230 =й | 


To complete the S. D. їп the case of frequency distribution, fhe 
following steps are made— 

(1) Choose the assumed mean, ‘A’, 

(2) take the deviations d=(X—4) from this assumed mean, 

(3) calculate the squares of d, 

(4) compute the prroducts ( fd) in a column, 

(5) compute the products (fd?) in another column. 


For illustration of the procedure, we calculate the S. D. from 
the following data— 

Height in cms. (X) : 60, 61, 62, 63, 64, 65, 66, 67, 68. 

No. of plants (f) : 2 0, I5, 29, 25, 12, 10, 4, 3. 
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The computational work is donê: іп! the following tabular 
«,Porm— 


A=64 


| 


= 1=  fd| 257—y fd? 


The 5. D. is given by— 


o= j E Ee \_ | fio- (Gan) озо ) 
=V (2:5579) 


51:59 
Computation of the S. D. in the case of frequency distribution 
of the grouped data with unequal Interval. 
In this case, the classes are replaced by their mid-valües and the 
rest procedure remains the same as explained’ above: 
In the case of a grouped data when the class-intervals are equal, 
a more convenient formula for calculating the S. D. is. 


i 
vix (PLE (EL) where ¿is the class-interval and 


d 


ram 
This is known asthe method of step-deviation. 


ү 
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The steps:according to: this:method-are «given below— 

(1) choose the assmed mean ‘A’, 

(2) take the deviations d=(X—A) from this assumed mean, 
(3) divide the deviations by: the class interval say i and call the 


value бо: 


(4) calculate the squares of 2 

(5) compute the products f č, ana 

(6) finally compute-the products ‘f &. 

For illustration, consider the following cxample— 

Age group (X): 20—30, 30—40, 40—50.'50—60, 60—70, 

(Years) 70—80, 80— 90. 

No. of persons (f): 3, 61, 132, 153, 140, 51, 2 

The computational work for S. D.-is shown in the following 
tabular form— 


A-55, =0 
ETT CO SEI Er i 
II у (мее e ле 
| sear | | | [| 
20-30 |25) 13 |-30| —3| —9 27 
30-40  |35| „61 | -20) -2| —122 | 24 
40—50 |45 | 432 |—10| —1| 932 132 
so—60 |55| 1533 | 0 | о 0 0 
60—70 |65| 140 | +10) «1| +140 | м0 
70-80 |75 | 1 | +20 | 42| +162 204 
80—90 |85 | 2 | F30 | JE NE 
Totals —15= ЈЕ] 165= 
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=10х y (1'4114—'0008)=10 x у (1'4106) 
-=10x 1'187 


=11'87 
c—11:87 


Exp. 14. Prove that Xx?— EX?— 


approximately. 
(ЕХ)? 
п 


Where x is the deviation of an observation & from 
the arithmatic mean X of the sample and n is the total 
number of observations in the sample. 


Sol. We have 


(M. Sc. Ag. Agra, 1963) 


x=X—X, where x-P* 


x?=(X—x)? 


and zx*—z(X— X) 


=s[X?+ Xt—2X X 
=X x?-—2 XXX 


=5X?+n X°—2n X? SAXEN 


Hence Proved. 


Exp. 15. Compute the S. D. of the following data by means 


of the formulae— 


| | aa IER y 
2 2 
& ==®/®—(®®) 
| sfe зла) 
and (c) c= " (£ 
! m 19-13. 14, 15, 16, 17.2 187 20 
IE oed 1132,21, 1575-8 5; 4 


| 
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ЕЕЗ : (а) For the computation of the S. D., the table is given 


х=15 | . 
esl [X J|a-x-x) |х | ا‎ 
12 1 5830 а a о ИН 
ТЗ лт] 143 |^ —2 4 44 
14 | 32 448 —1 1 32 
15 21 315 0 | 0 0 
16 15 240 | 1 | 1 15 
17 8 136 | 2 | 
18 | 5 90 | 3 | 
mi 4 ИРА aE 


= 2fX 1500. 
xeu 5 | 
Thus the S. D. is given by 


| ۷9 


д 7-174 approximately. 
(b) To compute the S. D., the following table is framed— 


x У x fX fx 

12^ alee 4 144 48 576 

13 11 169 143 1859 

14 32 196 448 6272 

15 21 225 315 4725 

16 15 256 240 3840 

17 8 289 136 2312 

18 5 324 90 1620 

20 4 400 80 1600 
Eu ПИЕ ааа 

omis |100=х/ | 5 /X=1500 |22,804=2f X? 


jae O e 
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The S. D. is given by 
- ыа. ] 
n n 100 100 
= /(228:04—225 
c—V3:04—1:76 approximately. 


(c) The S. D. is computed with the help of the following table— 


A=16 
x | у a= @ fa E 
ue I EDAD eene e i. cup eres, 
| i 
12 4 =4 16 —16 64 | 
13 11 =з о, || 2:93" КО» 
14 32 => 4 | —64 128 
15 21 =i ied cl 21 
16 П 0 0 0 0 0 
17 8 | 1 І 8 8^ 
18 gu 2 ZU INT TD 
20 re ae 4 | 16 ES 
| > 
н —100— А 
Totals | 100=х/ | | 7,1905 ozs ft 
_ јула (злу 404 [—100 
= М\ т п "A100 100 
=V(4:04—1) 


—V/304 
a= 1°76 ° approximately. 


The calculations ‘have been considerabely reduced by using 


2 
(4 di рл (E and so it is called short cut formula. 
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Exp. 16 Calculate the median and the quartiles using 
(a) mathematical method, and 
(b) graphical method, of the: following data— 


7—9 
9—11 
11—13 
13—15 
15—17 
Seth ЛЕ 


Sol.: First we prepare the cumulative frequency table— 


Class | frepuency ER 
We have 
sem : ~ тате xi 
ei e У where те 24ے‎ 
91i 10 20 So, median lies in the class. 
11—13 
11—13 12 32 p 
ма=1ї+ ха 
13—15 6 38 — 114-066 
1551/7 5 43 Median =11°66 
17—19 


The lower quartile is, 


LES 
Qi— Li fi xi where gt 12 and so, 


ioe CALY 
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zi 2 1t 
q2 4 дар ZY ђе EY {е (=) 
n n n n 


. eer erp) 
n n п п 


2 
=a? . os -b*o,? + ч {®ху—п®ў} 


2аЬ 
=a op boy? = z(x—&)(»—5) 
G= aog + b?c,? + 0 


Because E (x—x) (у—ў)=0 as х and у are independent 


variates. ; 
6,’=a"0,"+-ba,". Hence proved. 
(с) У (x+y)=V (x)+V (у) when x and y are indep. 
—05--12 and a=1, b=1 
zr. 
| S. D. (х++у)= У 7) 
or Stat =1°3 approximately. 
Similarly, V (x—y)=V (x)+V (у), where а=1, b= —1 
—05-4-1:2 
=1°7 


S. D. (x—y)=V (1:7) 
x- =1'3 approximately. 


Also, V (3x—4y)—9x V (x)--16x V (y) 


=9x0°5+16 х 1'2 asa a, 
—45-L-19:2 b=—4 
E09: 


S. D. (3x—4y)— (23:7) 
or (32-4) = 4°86 approximately. 
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Exp. 18. Write notes on : 
(a) Variance, 
(b) Standard error. 
(c) Coefficient of variation. 
Sol. (a) Variance: It is the square of the standard 
deviation i.e. 
x(X—Xxy 
n 
where X —represents the variate values 
X —the mean of the variate values 
n —the total number of items. 


Thus, variance is the arithmatic mean of the squares of the deviations 
from the mean. Jt is denoted by c? and is widely used in the 
statistical analysis of field experiments. 


Variance —(S. D.)?— 


(b) Standard error: It is the standard deviation of any 
statistic calculated on the basis of sample observations. It is widely 
used in the testing of statistical hypotheses. The S. E. of the mean 


is given by 22 where п is the total number of observations, on 
which the mean has been calculated i.e. 
G 
.E-—- 

R vn 
(c) Coefficient of variation : The S. D. and other measures of 
dispersion are the absolute measures of dispersion and are 
expressed in the same units in which the observations are given 
and hence cannot be used to compare the variations in the two 
given series which differ in their units and averages. To compare 
the variations of two such series, we require some relative measure 
of dispersion. The coefficients of variation is such a measure 
which is defined as the ratio of the S. D. to the mean expressed in 


percentage. Symbolicaily, 


g 
== х 100. 
C. V. M x1 


This statistic was developed by Pearson. In order to compare 
the variations in the two series, we compute the coefficients of 
variation for each of them. The series for which the C. V. is 


higher, will vary more than the other. 
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Exp. 19. Тһе scores of two golfers for 10 rounds each аге: 
A: 58, 59, 60, 54, 65, 66, 52, 75, 69, 52 
B: 84, 56, 92, 65, 86, 78, 44, 54, 73, 68 
which may be regarded as the more consistent player ? 
Р Sol. In the present problem, we have to compute the 
coefficients of variation in the two cases and then compare them, 
The one, which has less variability than the other, will be cailed as 
the more consistent player ? 
The computation is made in the following tabular form : 


t | Golter B 


ГЕМ ЕП? HE: Collen 

3 боев xm) ове ("ey (v-) v-v 
rss | —3 | 9 | 84 | +14 196 
2-59 2 4 26. аем 196 
КТАД 1 92 | +22 484 
«ir A a 7 49 65 | —5 25 
ا‎ 67 - | +4 16 86 „| +16 256 
6| «66 | +5 25 78 | +8 64 
a | 8l 44 | —26 676 
sers РВИ 196 54 | —16 256 
9| 6» |-+8 64 B | 43 9 
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=>У(52:6) 


= (216°6) 
=7:252 approx. =14:717 approx. 
С. У х 100= 7222 100 € Vue 1217100 


== 11°88 
Sol, Comparing the two С. Vs 
A is more consistant than player B. 
Exp. 20. (a) Explain the practical utility of the coefficient of 
variation in the Biological research ? 


= 21'024 
, we conclude that the plaver 


(b) The average heights of ten years old and 18 years old 
girls were reported to be 74:4 and 161'0 cms. respectively with 
standard deviations 2:64 and 6°12 cms. respectively. Which data 
varies more ? (M. Sc. Ag. Agra 1963) 

Sol. (a) It has been answered in the above question. 

(b) C. V. for the heights of 10 years old girls is 

C. Va 7 04 x 100 
=3'548. 
C. V. for the heights of 18 years old girls is 


6:12 
Ge Уз=твүХ 100 
=3'801. 
Comparing the two C. Vs., we conclude that the heights of the 


18 years old girls show the greater variability than that of the 10 
years old girls. 


Exp. 21. A random sample from a biological population is 
given below : 


MEME De "> suain * lo МИРО RUE 
Observati 
RENAT] 2037405 67 9 OF 10/10 12813314 915 


No. of grains/ | 
Бела 19 30 11 31 32 36 39 24 12 57 39 29 34 53 33 
x II TII i 


In the above series, sum of all the observations is 479 and sum 
-of squares of deviations from the mean is 2272۰93. 

(a) Calculate the following of the above sample. 

Mean, variance, standard deviation and standard error. 


(b) What is an array ? Illustrate your answer with the series 
en above. 
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(c) In samples of 15-20 items, the range is on the average 
about 3:5 times the standard deviation. Verify this statement from 


the sample given above ? (М. Sc. Ag. Agra, 1964 ) 

Sol. (a) We have m 

n=15 

=X=479 
= (X—X)*=2272:93. 
Hence, the mean (== _ ووو و‎ grains/ear head 
Variance (cx? —: LM 151-5286 
X(X—x) 2272:93 
S. D. 2= | = /|== 3) 


=V (151°5286)= 12:309 
S. E. (of mean) 8:10: = x (eme а! (Brie 5) 


n 


Е =V(10°1019) 
—3'17 approx. 


(b) Array: An orderly arrangement of the variate values is 
called an array. If the variate values i.e, the number of grains/ 
earhead are arranged either in ascending or descending order of 
their magnitudes, the resulting series will be called ап array. The 
practical utility of an array lies in the calculation of partition 
values. 


(c) Inthe given sample, the lowest number of grains per 
earhead is 11 and highest is 57, so the range will be given by : 
R—57—11—46 grains/earhead 
We have the S. D.—12:309 
so 3°5 х8. D.—3:5x 12:309 
=43:08 
which is nearly equal to the range value i.e. 46 grains/earhead, 


Thus, we can Saythat on the average the range is 3:5 times 


Nia efte S, V. Ante: sange is ise. 
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Exp.22. A random sample from a field experiment is given 
below : 


MEL ee PEE 


Obs.No. 12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 
EE i Ru E у. i ES 


Hts, of 
Plants іп |23 17 20 29 18 22 16 25 13 15 19 21 23 20 21 22 16 


cms. 
س ا‎ 


In this sample, the sum of all the observations is 340 and the 
sum of squares of deviations from the mean is 254. 

(a) Calculate six statistical constants, three each of measures 
of type and measures of variability, from the above sample. 


(b) Explain what is an array, with the help ofthe above 
sample. What is its practical utility ? ( M. Sc. Ag. Agra, 1965) 


Sol. (a) We have, 
15:11, 
УХ =340 
= (X—xX)?=254 
The three measures of central tendency are—Mean, Median, 
Harmonie mean (as the Mode cannot be found in the present 
example without the frequencies). 
Then Mean کم‎ 00 cms. 


values (heights 


The median can be found out by arranging the 
aanitudes. The 


of plants in cms ) in ascending order of their m 
arrangement will be as follows : 
13, 15, 16, 16, 17, 18, 19, 20. 20, 21, оТ, 22, 22. 23:32 


Md.=size of ће Gy item= size of eR item 
—size of 9th item. 


3, 25, 29 


The size of the 9th item in the orderly arranged series is 20 


and so the 
Median 20:0 cms- 
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The three averages (measures of central tendency) are found 
to be equal. 


For the three measures of dispersion, we take the range, the 


S. D. and the mean deviation (about:fhe mean) as computed 
below : 


Range (R) —highest value— lowest value 
—29—13—16 cms. 


SED. = ТЕ) i (14942353) 


=3°865 cms, approximately. 


Mean Beviation (M. 55-2141. 


where | 4| means (ће absolute Value of the deviation from the 
actual mean X i.e. ја | = {X-X |. 


Education 


in 


Istical Methods 


Stati 


134 


(ивәш ynoqe) 


*xoidde ‘suro $850. 6 = чопемор ueojq 


TS "CGU ПИЛЕ E s o :- c] Та 


w= € 2 S0 io Тос усе р C= 6 бг Х—Х=р 


ОРЕ 


ЭР Geer ОЕ СОБИТ г стар бр бс uy £c x 


ддд 
ПЕЕ ООО S a EFE S 


5910], 


: 9,92) шморо oq; «Urey О} әле әм “уг enduro ог 


Measures of Central Tendency (averages) and Dispersion 135 


Hence we have, the three calculated — 


Measures of average Measures of dispersion 
Mean —20:0 cms. Range=16 cms. 
Median— 20:0 cms, S. D.—3:865 cms. 
H. M. =19:9 cms, M. D.—3:0588 cms. 


(b) The answer has been given in the previous question. 
Exp.23. Account of the number of grains on each of fourty 
tarheads of wheat gave the following results— 
3212232025 80100826 SP Sule ОЗ 
29. 39 34 26:27 38 27 34 
33 27 29:27 31 38. 3739 
18 34 35 40 34 43 24 31 
28 43 20 33 28 34 29 28 
Present the data in the form ofa frequency distribution, 
рш the mean and standard deviation of the distribution from 
© grouped data and state the standard error of mean number of 
Stains per earhead, (M. Sc. Ag. Agra 1958) 
Sol The lowest magnitude is 17 and highest is 43. Taking 
the lower limit of the Ist class 16, and keeping ihe class-internal 
on 4, we shall obtain the following seven classes with their 
B ive frequencies, The border lines items are placed in the 
Ses Where they are as the lower limits. 


Class Tally marks Frequency 
16—20 || | 3 
20—24 | | | | 4 


A on IU 7 
28—32 NW T 8 
32—36 ММ 10 
36—40 nu 


40—44 ||| 3 
po rs —— 


Ur 
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To compute the mean, S. D. and the standard. error af mean 
number of grains per earhead, we form the following table— 


A=30 
Mid value Х| d=X—A | q? ip ја Ја? 
18 —12 144 3 —36 | 432 
22 —8 64 4 —32 | 256 
26 —4 16 7 =98 | 112 
30 0 0 8 0 0 
34 4 16 10 40 | 160 
E: 8 64 5 40. | 320 
42 12 144 3 36 | 432 
н A +20 |1712 
potas er ninalla D ns eer za 
"^ Mean is given by 
| xA Lfd 
n 
20 
= 30--20 3 
= 80+05 


X=30'5 grains | 
Zfd (ZfdaYvA  Ifiz2 [2015 
зр. (= ЛУ ie |} ЙК; - (6) j 
=V (42:80—0:25)/(42:55). 
S. D.—6:52 approximately, 
S. E. of mean no. of grains/earhead is given by 
| S.D. (652 652 
S. E. Va СУ @УТєзә 103. 
Standard error—1:03 grains, ) 
Mean=30°5 grains | 
S. D. —6:52 grains 
S. E. of mean no. of grains/earhead | 
=1°03 graius ss = "ans. 
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i low 
Exp. 24. The length of 50 earheads of wheat аге given be 
in centimetres— 


104 108 85 109 105 94 111 70 Se ud 
109 11:2 105 97 108 84 93 112 e 
122 89 96 110 104 99 9:0 105 S tae 
11:5 97 106 104 95 84 87 91 no e 
118 78-101 10] 84 110 95 94 


istribution 
(a) Presentthe data-in the formrof a frequency distribu 
by Choosing a suitable class-interval. 


LO Ei d 
(b) Calculate-the mean and median of the ies 
represent it graphically. (M. Sc. Agra, 


Sol; , (a). The lowest magnitude-is 7:0 and een 8 IE 
Taking the lower limit of the first/class 7'0 and koppe е ан 
interval equal to 0:8, we obtain the following seven C s um 
their respective frequencies written” against them. ind 
line items are placed in the classes where? they are as 
limits— 


Class Tally marks Frequency 


70-78 | || = 
78—86 | 5 
86—94 | | 6 
а ОВТ 
102—110 | 13 
ШТ 9 

2 

| 50 


11:0—11:8 


1r8—126 | || 


Totals 50 
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(b) To calculate the mean of the distribution, we form the 
following table— 


Mid 
Value |d=X—A| f 
X 
74 | —24| 2 
82 —1'6 5 Hence the mean is 
E хуа 
90| —08| 6 x- Aq 
| 0 3 { 
9:8 1 "e 120 
10:6 08 | 13 
—9:8-L024 
at 16 z X=10:04 centimetres 
12:2 2:4 2 к 
Totals — 57—50 


For the computation of median of the distribution we prepare 
the following table— 


Class lm c. f 


Thus the median is 


T:0—.:8 
т-с. 
78—86 Md-L4- f xi 
8:6—9:4 __ п+1_504-1 * 
where MO a EDS 5 
9:4—10:2 Я 
Hence the median lies in the f 
10:2—11:0 class 9:4— 10:2 and so | 
КО» L—94, c—13, f=13, i-98 | 


118—126 
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Md-9443 ×8 
—9:4--0:7692 
—10:1692 
Mdæ 10:17 centimetres. 
Graphically the median=10'2 cms. and it is shown in the 
adjacent graph. 


2 units 


V. S. 
c уста фу. 


2,0} BE a | ver Ire WO 126 
H. S. 
— — 1 small div.—0'2 unit 
Graph No. 2 


E Exp. 25. (a) In a series of 75 items, the otal of all'the items 

el found to be 114:55 and their sum» of squares 1757125. 
alculate the.mean and the standard deviation? 

^: (b) For the marks obtained by 75 students in а certain test, 
e sum of the deviation from the assumed mean 17:5 was 330 


a 
nd the sum of squares of the deviations was 6250. Calculete the 
mean and the standard deviation ? 


Sol (a) Weare given n—75 
Ух= 11455 
Exe = 17577125. 


р __Бх_114'55 a. 
i. Mean= x eS =1:527 
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ок 
EU DUROS _ — (1:527)? \ 


=V (2° ml 2:3327) 
=\/(0:0102). 
в —0:101 approximately. 
(b) Уећауе, п=75 
A=17°5 
54—330 
zd — 6250 
Mean=A 434 


= 17:5 350 


Mean= 17:54-4:4— 21:9. 


И 
- Jen] 


= V (83:3333— 19:36)= V (63:9733). 
2—8'0 approximately. 
Exp. 26. How many types of frequency functions do you 
know ; sketch their graphs also ? 
Sol Following are the types of 
distributions : 


() Symmetrical Unimodal curves : This type of the curve 
is symmetrical about its maximum ordinate. The class-frequencies 


simple frequency 


X| 

< 

= 

SYMMETRICAL UNIMO DAL 
CURVE 


CRDINATE 


Fig. 1 


um — тне. т Ol m an dcin рибе" = 
A—À—— ———;"— 


80 оп decreasing symmetrically to zero on both the sides of the 
Central (maximum) ordinate. A very important of this type is the 
normal curve. 


(2) Skewed Unimodal curves : In this type of curves, the 
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frequencies fall more rapidly on one side of the maximum ordinate 


POSITIVELY SKEWED CURVE 


Fig. 2 (a) а 
than that of the other. Such curves have а tail on. one side of the 
max. ordinate. If the tail [ 
is on the right hand side 
| Of the max. ordinate,the 
| Curve is siad to be the 
-Fvely Skewed. 
When the tail is on 
the left hand side of the Fig. 2 (b) 
| max. ordinate the curve is said —vely Skewed curves. 


(3) J-Shaped curves : In this type of distributions, tho 
| maximum frequency falls on one end of the distribution; such as in 


NEGATIVELY SK 


' y 
+ ушу ie 
‘O-SHAPED Ur 

| 5нарер 
f Curve 

1 

| / 

/ 
4 


= 
обе === 


4 - x 


Fig. 3 в 
the bank-balances and other income-distributions, They may be 
+vely, or —vely J-shaped, 
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.(4) U-shaped or anti- y 
modal curves: In this 
type of curves, the fre- 
quencies start from a maxi- 
mum value and then fall 
to a minimum and again 
increase. These curves 
may be symmetrical or may 
not bz. Such type of == Ч 
distributions occur in the О SYM U-SHAPED dates 
number of unemployed 
persons by age-groups. Fig. 4 (a) 


У 


x, 


ASYM.U.- SHAPED CURVE 
Fig. 4 (b) 


А hid 
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EXERCISE No..() A 

(Problems on Ch. 1, H, HD 

" О. 1. The yield of rice in 55 equal plots of a village is given 

In maunds as follows— 

62 59- 23. 27 77 33 430 18. 475 8932 
63 24 17 72 30 45 42 $8 26 36 21 
25 13 64.37 12 485 5 29 41. 40' ob 
35 67 36 18 28 92. 92. 2.20 55-27 

65 52 15 21 78 34 40 16 57 81 31 Prepare a 


histogram, frequency polygon and the frequency curve. 
(U. P. Board, 1964) 
‚ 0.2. (a) What isa frequency distribution ? What conside- 
rations are involved! in forming a ir:quency table froma body of 
Observational data on a quantitative character ? 
( M. Sc. Ag. Agra, 1956) 
(6) Make а freqcency table having grades of wages with class- 
intervals of five annas each from the following data of daily wages 
received by 40 labourers in a certain factory— 
5, 15, 30,22, 30). 25,40, 10 
6: 20= 15" 25° 32 22. 20" 10 
ene 20030" 22° TIL IS 
Cais eile 22) 87 33.9254 35 
MOS O "15120 37 20 42 З 
Q.3. Draw the ogive of the ‘classified data obtained from 
Q. No. (2) (b) and find out median & quartilies from it, graphically ? . 
_ Q.3. In the following table, the heights of 2 plants are given - 
In centimeteres at various ages— 


Age | Height of plant — 
e) А іпст. | B in em. 
40 153 156 
60 | 167 M3 | 
80 | 182 | 180 | Draw the graphs for the 
| | growth of the two plants and 
100 | 187 185 | comment on their growth. 
120 | 198 | 195 
140 201 200 
160 219 219 
180 220 231 
200 20 1.23 
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О. 5. (a) Represent the following data by a suitable 
diagram — 


: Е 
Dose of nitrogen Height of plants in cms, after 


in Kgm./Hectre 30 days 70 days | 
А o 


0 28 52 
22 35 64 
44 40 75 
66 38 67 
70 37 63 


(b) The following table gives the total outlay on rural 
development proposed in the first five Years plan and its break 
downinto major items. Give a suitable diagram to Tepresent the data— 


Amount (in 


Item crores of 
rupees) || 
Agr. & community development 371:43 
Irrigation... Er Ves T. sss 178:97 
Irrigation and power Уз? 21 i5 276:90 
Power а us b se sis 138:54 
Transport and communication... гав 508:10 
Industry... n " ... ... 184:04 
Social Services 35081 
Rehabilitation 96:00 
Miscellaneous Dr S is = 62:99 
| 


Тога! 


| 2167:78 
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Q.6. The consumer's 


price-index and 


its 


three most 


important component indexes for a country from 1935 to 1944 are 
given in the table below : 


oco ——Á —— —— M Á———— 


Year SE | Food | Clothing Rent 
1935 98:1 100:4 96:8 94:2 
1936 99۰1 101:3 976 96:4 
1937 102:7 105:3 | 102:8 100 9 
1938 100:8 97:8 | 102:2 104:1 
1939 99:4 95:2 i 1005 104:3 
1940 100-2 966 | 1017 104:6 
1941 105:2 105:5 106:3 106:2 
1942 116:5 123:9 1242 108:5 
1943 123°6 15570 1297 108:0 
1944 125:5 1361 1388 | 108:2 
| 


Ма ИЕ УРИНА ПИ MM 


Using graphical-method, comment particularly in respect of 
the periods 1937-38, 1939-40 and 1941-44. 


(М. Sc. Ag. Eco. Agra, 1965) 


Q. 7. The following data gives the marks obtained by a batch 
of candidates in a certain examination in Mathematics and 
Statistics. In which subject is the level of knowledge of the candi- 
dates higher ? Give reasons ? 
Marks in Maths. : 
Marks in Stat. : 


14, 16, 16, 14, 22, 13, 15, 24, 12, 23, 14, 20, 17 
21, 22, 18, 18, 19, 20, 17, 16, 15, 11, 12, 21, 26 


Q. 8. (a) Define the median and its utility ? Calculate the 
median from the following data : 


Class Interval 


Freguency 


42 


9-25 


25-41 


o m 
Ши ЕГ 
= ~ 
+ e 
47 40 


73-89 


89-105 
105-121 


N 
= 
(27 


121-137 


~ 


са су 
n © 
a d 
ف ل‎ 
e о 
= = 
25" 52 
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(b) Also compute the quartiles and the quart; 


viation from 
the above data. (M. Sc. Ag. Ec 


igra, 1965) 


О. 9. The following table gives the yield of wheat from 10 
equal plots : 

Plot No. 1-2 3 4 8 6 7 8 9 10 Total 

Yield inkgms. 60 40 50 45 60 55 65 50 65 55 | 45 

If the area of each plot is 242 square yards, find (Ае average 
yield per acre ? Also calculate the coefficient of variation (1 acre 
=4840 sq. yds.). (U. P. Board 1963) 

О. 10. The following table gives the marks obtained by some 
students. Calculate the arithmatic-mean and the mode : 

Marks 0-10 10-20 20-30 30-40 40-s0 

Frequency 3 13 18 12 5 

(О. P. Board, 1963) 


Q: 11. An analysis of the month 
in two firms A and B belon 
follówing results : 


ly wages paid to the workers 
ging to the same indusiry, "gives the 


Firm A Firm B 
No. of wage earners : 586 648 
Average monthly wages : Rs. 52:5 Rs. 47°5 
Var. of the distr. of wage ; 100 121 


(a) Which firni pays out the larger amount as monthly wages 
tothe workers ? 
(b) In which firm А or B, is there greater var 


ibility. in 
individual wages ? , 


(c) What is the measure of average monthly w. 


ages of all the 
workers in the two firms A and B taken t 


ogether ? (7. 4. S, 1951) 


Q. iz. Following is the distribution of marks 


secured by 139 
candidale:. Calculate the S. D. 


and the coefficient of variation ? 


© e [=] [=] e e e 
Marks (x) JR. CE Ди Це ee Un Uy 

= E E Se e 
Frequency (f) 5 10 20 40 30 20 10 4 139 


Measures of Central Tendency (averages) and Dispersion 147 


О. 13. Write short notes on the following : 

(i) Classification, 

(ii) Histogram, 

(iii) Ogive, 

(iv) Frequency polygon and frequency curve, 

(v) Median and Mode. 

Q. 14. The scores of two golfers for 10 rounds each are : 


A: 9, 17, 14, 13, 15,0105 175 13, 13 
Bi: б 15 Тр tive У 9, 14 
Which player may be regarded as more consistent ? | 


^Y 


Q. 15. The following table gives the heights and. dry weights 
of 17 plants. Calculate the mean and S. D, for both the characters 
and find which of these two varies more 2 


Heights in ems. (x) : 

50, 44, 54, 46. 64. 37 17.62, 45, 69, 44, 57,63, 6%, 45, 11, 50 
Weights in ams. (y) : 

4d, o5 44, 26, 41, 21, 60, 52, 15, 55, 42,58, 37, 36, 40, 98 36 


О. 16. In any two series, where d, and d; represent the 
deviations from the assumed means A,=50. А,= 154 have m=10, 
m=, Zd,— 5:8, 2d? 155:66, 20d, =3, Xdj—ssWT0. Calculate 
the coeMicients of variation for the two series 7 
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Q.2. (b) Class Freq. 
5-10 Т 
10-15 
15-20 
20-25 
25-30 
30-35 
35-40 
40-45 


The boundary line cases are kept in the classes where they 
are as lower imits, 


Q. 7. Statistics ; (median is higher). 
О. 8. (a) М4.—53:08. 
(b) Q,—31:916, Q.— 71:390, О. D.—22:737 

Q.9. 1090 Kgms./acre, C. V.—1447. 
Q. 10. A, M.—25:59, Mo.— 24:545, 

. 11. (a) B, (b) B, (c) 49:9 Rs. 
12. S. D.—15:6, C. V2 > 
B. 
15. &—55:65 cms., 3—40:59 gms., S. D. (х)=11:71 cms., 


S. D. (y)—13:45 gms. weights vary more. 
16. C. V..=17:1, C. V..=50°7. 


NUAR OUA 


o оооо 
T 


Chapter IV 


Correlation and Regression 


Meaning of correlation : In all the previous chapters, we haye 
dealt with elementary statistical analysis involving only a single 
measurement (variate X). Many statistical problems arise, in 
which the sample consists of pairs of measurements. For example, 
consider the behaviour of two variables, volume and pressure of а 
gas at a fixed temperature. In this case, we observe the relation 
that the pressure decreases as the volume increases, and if the 
pressure increases the volume decreases. 


Another example may be taken as the behaviour of. volume of 
a gas at a fixed pressure, In this case, the volume increases with 
the increase in the temperature and with the decrease of 
temperature, the volume also decreases. 


In the former example, the changes in the two variables are 
in the opposite directions and in the later case, the changes are 
in the same direction. There isa number of phenomena of this 
type in exact and social sciences, where it is observed that the 
change in one variable is on an average accompanied by а change 
in the another variable, either in the same or in the opposite 
direction. The variables, which show such type of relations are 


said to be correlated, and the phenomenon itself is called 
“Correlation”. 


Definition: The two variables are said to be correlated when 
the change in one variable is on an average accompanied by s 
change in the other variable, in the same or opposite directions, | 

Direction of correlation : The correlation is said to be positive 
vr negative accordingly as the changes in the two variables аА 
the same or reverse directions. Д 

For example, the heights and weights of a number of persons 
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show the +ve correlation. Another examples of +-ve correlations 
are— 


the radius and circumference of a circle, dividend declared by 
a company and the market price of its share, the total cultivable 
area and the area under wheat etc. On the other hand, the price 
and demand of a certain commodity, area under fodder and non- 
fodder crops of a locality, the total income and the proportion of 
it spent on food, are the examples of —ve correlations. 


Degree of correlation : There are three degrees of correlation, 
namely perfect, limited and absence. The correlation will be 
perfect when the ratio of the changes in the two variables is the 
same. The radius and circumference of a circle show the {ve 
perfect correlation. There are some instances where the correlation 
does not exist as in the case of height of a growing tree and that 
ofa house, since the change in one is not accompanied by the 
change in the other. The degree of correlation may be limited. 
In this case, the changes in the two variables are not in a definite 
ratio or the percentage of change in one may not be equal to the 
percentage change in the other. For example, a 10% decrease 
it price may result in a 15% rise in demand for a certain 
commodity. 


Thus, when the percentage changes in the two variables are 
the same, the correlation is perfect and when the percentage 
changes are not equal, the correlation is limited, According to 
the direction and degree of correlation, it may be— 

(1) perfect positive, 

(2) limited degree of positive, 

(3) no correlation, 

(3) Шпкед degree of negative and 
(5) perfect negative. 

1. Perfect positive correlation : The correlation will be 
perfectly --ve if an increase (or decrease) in one variable is 
accompanied by an increase (or decrease) in a perfectly definite 
ratio in the other variable. The radius and the circumference 
ofacircle, the volume and the temperature ofa gas at a fixed 
pressure, are the examples of perfect +-уе correlation. 

2. Limited degree of +ve correlation: If the increase (or 
decrease) inone variable is onan average accompanied by an 
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increase (or decrease) in the other variable, but not necessarily in 
the same ratio, it is called limited degree of ve correlation. Since, 
as the total cultivable area increases, the area under wheat also 
increases but not necessarily in the same ratio. 


3. No correlation: The two variables are said to be 
uncorrelated when the change in one does not effect the other. 
As an example of this type is the number of radio sets produced 
and the total number of births recorded during a certain period. 


4 Limited degree of —ve correlation: If the increase (or 
decrease) in one variable is on an average accompanied by a 
decrease (or inbreasc) in the other variable, but not necessarily in 
the some ratio, it is called the limited degree of negative 
correlation. The area under the fodder and non-fodder crops will 
be an example of the limited degree of —уе correlation. Since, 
as the area under fodder crops increases, that of non-fodder crops 
fall but not necessarily in the same ratio. 


5. Perfect —ve correlation: The correlation will be perfectly 
— ve, if an increase (or decrease) in one variable is accompanied 
by a decrease (or increase) in the other variable in a definite ratio. 
The two variables, volume and pressure of a gas at a fixed 
temperature show the perfect — ve correlation. 


Measures of correlation : 
(1) The scatter diagram, 
(2) Pearsons's coefficient of correlatio,n 
(3) The regression line. 

1. The scatter diagram: It is a graphic method of 
determining the degree and direction of the correlation. The 
method lies in plotting the (х, y) points on the graph paper. 
Merely an eye-inspection of the scatter-diagram (the diagram 
obtained by these points) is sufficient to decide the degree and 
direction of the correlation. 1f— 

(i) all the points on the scatter diagram lie on a straight line 
with +ve slope (starting from the lower left hand corner 
tothe upper right hand corner), the correlation will be 
perfect -+ve (as shown in fig. A), 

(ii) all the points on a scatter diagram lie along a straight 

line with — ve slope (i. e. the line starting from the upper 
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left hand corner to the lower right hand corner) the 
Correlation is perfect —ye (as shown in fig. B.), 


(iii) the points on a Scatter diagram seem to cluster about the 
diagonal with уе Slope, the correlation will be said as 
limited degree of 4-е correlation (as shown ia fig. C.), 

(iv) the poihts on a scatt 
diagonal with — ve 
limited degree of 


er diagram seem to cluster about the 
slope, the correlation will be said as 
— Ye correlation (as shown in fig. D), 


(у) the points on a scatter diagram take roughly the elliptical 


orthe circular Shape, then the two variables are said 
uncorrelated (as shown in the fig. E), 


(vi) the scatter diagram take: 
Correlation will be curvilin 
in the cases (i)-(v) the c 
are concerned with line 


5 the shape of a curve, the 
ear (as shown in fig. F), while 
orrelation is linear and here we 
аг correlation only, 
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Exp. 1. Point out the shortcomings of scatter diagram in 
measuring the correlation ? Suggest a measure free from these 
shortcomings ? Illustrate, how it is computed ? 


Sol. The scatter diagram gives a clear idea about the 
direction of the correlation but as regards the degree of correlation, 
it can only tell us, whether the correlation is perfect or limited. 
In the case of limited degree of correlation, we required to know 
the extent of correlation to which a scatter diagram fails to answer. 
Moreover, the study through diagrams depends upon the accuracy 
and skill of the observer which is an uncertain factor. Hence our 
study and the conclusion drawn from it, may not be a reliable one. 
Due to this reason, we are in need of some arithmatical 
description which can provide a pure number free from the units 
in which the variables have been expressed and may indicate the 
direction of correlation also. The value of such measure should 
be —1 for perfect — ve correlation, +1 for perfect +ve correlation, 
and zero for non-correlation. The —ve values lying between 
0 and —1 will indicate the limited degree of —ve correlation and 
+ve values lying between 0 and +1 will indicate the limited 
degree of +-ve correlation, One such a measure is the coefficient 

. of correlation, which is also called Pearson's coefficient of 
Correlation or the product moment correlation coefficient. This 
Pearson's coefficient. of correlation is generally denoted by ‘r’ and 
is given bythe relation 

Ж COURS Ms 39) where cov.(X, yc ZG XO 

ох.оу 


i.e. the mean of the products of the deviations of the variates 


X and Y from their respective means X and үле 


ox=S. D. of X= JEE”) 


and сұ==8. D. оү | 2?) 


oc LBS Ier sa 
Hence, - (ES) pum 
= n 
POSEE ње 
VIE X-X )). У, (1—7) 
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1 If the deviations are taken from the assumed means, the 
‚ formula takes the form— 


Z2 Ep—Z Е ул{п 


Ee Ae} 


Where 2,7 аге the deviations ofthe variates x andon from 
their respective assumed means A, and Ay. 


This formula is of great importance when the deviations from 
the actual means аге fractional. In the computation of 
correlation coefficient, the main Steps are given below:— 

(a) (i) choose an assumed mean A, for the variate Xe 
(ii) obtain the GRE E—X— A,, 
(iii) calculate £ 


(b) (i) choose the assumed mean A, for the variate Y, 
(ii) obtain the deviations ?— Y — Ay 
(iii) calculate the squares of 7 
(c) (i) compute the praduct £y, 
For the illustration of this method, we compute r $n the 
following examples— 


Exp.(2): The percent carotene content of wheat (X) and” the 


percent carotene content of flour (Y) for 10 varieties of wlieat 
were found to be as follows— 


X: 1.18 2.13 1.41 1.42 1.50 1.25 1.65 1.24 1,48 1,35 


Y: 2.39 3.11 2.15 1.96 2.02 1.76 2.10 2.12 2.28 1.86 


Find the correlation coefficient for X and Y ? 
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Sol.: To compute the correllation coefficient for X and Y, 
we prepare the following table— 


A,=1.50 A,=2.02 
Variet ; 
My xX |-хА | E y h-v,| т | 8 


l.-.| 1218 | —.32 | 0.1024 | 2.39 | +-37 0.1369 |—0.1184 


2. | 2313 | +.63 | 0.3969 | 3.11 | +1.09 | 1.1881 +0.6867 
3, | 1.41 | —.09 | 0.0081 | 2.15 | +.18 0.0169 |—0.0117 
4, |1.42| —.08 |0.0064 | 1.96 —.06 | 0.0036 |+0.0048 
5 1.50 0 0 2.02 0 0 0 

6 1.52 | —.25 0.0625 | 1.76 | —-26 0.0676 |4-0.0650 
7. |1.65| +.15 |0.0225 | 2.10 | +-08 | 0.0064 +0,0120 
gy | 1.24 | —.26 | 0.0676 | 2.12 | +.10 | 0 —0.0260 
9. |1.48| _-02 | 0.0004 | 2.28 | +-26 | 0.0676 —0.0052 


40.0240 


= 
о 
= 
~ 
un 
Eh 
л 
© 
2 
S 
~ 
Un 
oc 
fon) 
| 
ES 
o 
о 
hs] 
ч 
с 


Тога п—10 |—.39 (0.6893 т=10 1.55. [1.5227 0.6312 
SENE EEq (28. = 7 


EA ла пик заара а 


Now the correlation coefficient is given by the formula— 


хт 22. 
=e] 
39x 1°55 


0.68312—— dg —- 


Jp eo- С WI 1527. 1:55) ] 


0.63120- .06045 
= (0.67409) V/(1 . 28245) 


= 
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0.69165 0.69165 
1(0-8644867205) 7 0.92977 + 0+ 7348 
r—--0.7348. 


(2) Pearsos's coefficient of correlation : 


Exp.(3): In determining volume of red cells in defibrinated 
beef blood by a certain method, the following . results. were- 
obtained— 


Percent 
of whole| 
blood in| 
mixture 
(X) 


= 


00100 |100 | 90 | 90 | 90 | 80 | 80 | 80 | 50 | 50 | 50 | 


Percent 
of volume; 
pocupied 40: 544: 5146-035- 540: 5/41 "013201360136: 5/20: 022-023- 
y re 
zells (У) 


——————————— ———————————— 


Find the product moment coefficient for X and Y ? 


Sol.: The computation of product moment coefficient is 
made in the following tabular form— 
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A,—80 A,=40°5 
S.No. X Z£-XA, Ё Y p-Y-A,| 7" En 
1. | 100 | +20 | 400 | 40-5! 0 0 0 
2. | 100 +20 | 400 | 44:5) +4°0 16-00 | +8000 
3. |100 | +20 | 400 | 460 | 45:5! 30:25 |11000 
4. | 90 +10 100 355 | —5:0| 25:00 —50°00 
5 90 +10 100 | 40'5 0 0 0 
6 90 +10 100 41-0 | -0:5 | 00°25 +5°00 
7. | 80 0 o | 32-0} —8:5 | 72°25 0 
8.. | 80 0 o |360| —45| 20°25 0 
9 80 0 0 36:5 | —4°0 16-00 0. 
10. | 50 —30 900 20:0 | —20°5 420°25 |+615°00 
11. | 50 —30 | 900 | 22-0) —18:5 342-24 +555°00 
12. | 50 —30 900 23-0 | —17:5| 5 +525°00 
Totalsn—12 | o=% | 4200 |п=12 | —68' 5 11248-74 |+1840 00 
=e =й =57? == 
O И __--__|- —— و‎ 
Hence we have | 
ze 2-2 
тре [9] 
1840— beste 6s 5) 
ЫЫ. | | RYE 3) 
1840 1840 
ОЕ. 7192) 4/(3602420. 64) 1898. yrs ogee 


с. gem 40.97. 
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Ex.(4) A research worker observed the following fresh 
weights and dry weights in a sample of his experimental material— 

Observation No. 1 2 3 4 5 6 7 8 9 10 

Fresh wt. (oz) SO IONS I22 O15 (14918 

Dry wt. (oz) ЗА ОНО АНИ S 4 34 

(a) Calculate the correlation between the two characters and 
interpret them ? 

(b) Represent the data in a scatter diagram ? 

(M. Sc. Ag. Agra 1963) 


Sol. : 


(a) Taking fresh weights by X and dry weights by Y, 
we prepare the table— 


A =11 А,=3 
| 
Ne X |Е=ХА, үр Y М=ҮА, 7? m 
fis e OX AS | 9 Зи CORE MO 0 
о OS | 55 zu ! 5 
| ду | 7 am il) al / + 
TG 36 Dal ا‎ 1 +6 
она AT ecc ql ЧА 
бал] 2 - 9 — gi ако +18 
UM 204 Ee ат 5 +2 |4 +18 
ED, +4 | 1674 47 | sema а yt 
зо 3 ОДА NO 9 
ПАЈА У ТЕЛА РЕДА ag A crash. Faal 297 
Totals n—10 | 0—2 a à п=10 | 0—Z» м-р 60=>7 


o E ERE Н n т” 
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Now we have— 


*i EE rp Z] 


ون 


E 10 m 
(0)* (0 
АЕ | 
C MN o 0 30 0.9137 
V(QG08)/(14) - 14x 4/22 7x4.69 32.83 
s: 5—-1-0:9137. 

Interpretation: Тһе two characters show а high degree 
of correlation. It means that there is nearly a linear 
relationship between the fresh weights and the dry weights of the 
sample. 

(b) The scatter diagram for the fresh and dry weights : 

In the scatter diagram, the points (X, Y) cluster around the 
diagonal;with ve slope, which shows a high degree of -- ve correlation 
in the fresh and dry weights. 

Scatter Diagram 


V.S.— ——1"—2 (oz.) dry wt. 


($9 5 to 45 
H.S.—1 small div.— 1(02.) Fresh wt. 
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Exp.(5): Show that the coefficient af correlation lies 
between —1 and +1. (M. Sc. Agra, 1948, M. Sc. Ag. Agra 1965) 
Sol.: Consider the quantity 
S = ]د‎ к=) је Since the sum of 
squares is always Ve. 
Where X, Y are the means of the variates X and Y respectively 
and cx, cy their S. Ds. 


or > 2 поени А CDC YE 


2n ox? сү? ox Gy 
—x)?9 —y)? — У XJ 
s= | 20050:  za-v» 20Ү У о 
n.ox? п.ү n.GX6y 
or i[I-J-1—2r]z0 
or 1—rz0 
XN UR XN а-а ои LENS (i) 


Further, taking the quantity 
|! X-X) , (Уу) 
s=. 2(%2 У += | > 
Which gives us 


1+r >0 
e уљане! ШЕИ ИШ Ls ote (ii) 
Combining the results obtained in (1) and (2) we get, 
—lI&r«-l. Hence proved. 


Exp. (6): Define rank correlation? Illustrate, how will 
you complete it with the help of the following example — 

Marks in mathematics (X) : 80 75, 72, 72, 70, 68, 62, 50 

Marks in Music (Y) : ТО 12217, 17717, 22,20, 23 

Sol. 


Def.: The correlation between the grades (ranks) of the 
variables is called ‘rank correlation’. 


It is computed in the circumstances when we want to know 
the relationship between the proficiencies of a group of candidates 
in the two different subjects A and B or we wish to investigate 
thé degree of agreement between the two judges who have graded 
the same individuals regarding the same characteristic. The rank 


correlation is given by the following formula [due to spearman] 
буа? 
т=1— прау where d—represents the difference between the 
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ranks of the same individual, 
n—the number of observations. 

Method of ranking: First go through the series carefully and 
note. the item with highest magnitude. This will be assigned 
rank 1, the next lower rank 2, and so on, ...... , the item with 
lowest magnitude will be assigned the highest rank i.e. equal to 
the number of observations. 


Consider the series— 

78, 36, 25, 98, 75, where the 4th item has the highest magnitude 
go its rank will be 1, next lower item is 78 so its rank will be 
2,then comes 75 with rank 3, 36 with rank 4 and 25 the lowest 
value with rank 5 (the highest rank). Now we summarize the 


result as below 
Measurement (X) : 78, 36, 25, 98, 75 


Rank (): 2, 4 5 1 3 


[n the case when two or more items of (X) are equel, the 
rank awarded to each of them will be the A, M. of the ranks that 
they would have received if they had differed by a small quantity. 


For this, consider the series— 
81, 78, 73, 69, 73, 78, 62, 73 


Let us first suppose that all the values are different, then the 
rank assigned to Ist item is 1, (being highest value), to second 
item is 2, to sixth item is 3, to third item is 4, to fifth item is 5, 
to, eighth item is 6, to fourth item is 7 and to the seventh item 
is,8. Here 78 has appeared twice, so the rank assigned to 78 


will к= i.e. 25, 37 also appeared thrice so its rank will be 
OOS, Now we summarize the result as follows— 


Measurement (X) : 81, 78, 78. 69, 73, 78, 62, 73 


Actual rank (r) : 195025 S ENS 225: EG 
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The computation of the rank correlation is made as follows— 


k 
X | Rankofx | у [Rank ory КИК, „| а 
80 1 10 8 7 | 49-00 
75 2 12 7 5:00 | 25-00 
72 3:5 17 5 1°50 | 2-25 
72 3:5 17 5 | 1-5 2:25 
70 5 | 17 5 0 0 
68 6 22 2 —4 16-00 
62 7 20 а Ма» Уа 16:00 
50 8 23 МА Е |а 
Totals — | — = = 159-50 
ne oe LERNEN =2@* 


Coefficient of rank Correlation= ] — CODE 
n(n?— Т) 
—1— 6% 159.50 _ 157 
Fi 8(64—1) = 504 
* r=—0.898, the high— ve value shows that a student best in 
Maths is worst in music, 
Exp. (7): The ranks of 15 candidates in a be 
ranked by two judges were as follows— 


(The two numbers within brackets denoting the r 
same candidate) 


(6, 8), (7, 3), (8, 1), (9, 11), (10, 15), (11, 9), (12, 5), (13, 14), 
(14, 12), (15, 13), (1, 10), (2, 7), (3,2), (4, 6), (5, 4). Show, that 
the two judges do not differ in their opinions ? 

Sol. The computation of 
follows— 


4=2, —4, —7, 2, 5, кол Л e EY 5 с 
d=4, 16, 49, 4, 25, 4, 49. l, 4, 4, 81, 25, 1,4, 1 
2d?=272, п=15 


ашу context 


anks of the 


the rank Correlation is made as 


——, 
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22 6 уа? 
` п(и—1) 
_ _6x272 
15 (225—1) 
2151162 
Е) 3360 

'. r=+0.5142 

The +ve value of the rank correlation coefficient shows 
that the judges do not differ in their opinions, 


Él 


(3) Regression: The correlation theory deals with the 
relationship exhibited by two variables i. e. whether the change in 
one variable tends to bring the change in the other in the same 
or inverse direction. When such relation is exhibited by 
the two variables, then it is also intended to calculate the amount 
of change that will on an average take place in the other variable 
when a unit change has occured in one. This average change is 
given by the regression-coefficient. 


Using the regression coefficient, we can give the expected 
value (average value) of a variate corresponding to a fixed value 
ofthe other variate. The variate which is kept fixed is called the 
independent variable and the other (which is to be estimated) is 
called dependent variable. Of the two variables, we choose that 
variable as independent variable whose effect produces the 
correlation. For example, consider the rainfall and yield of rice 
which are often correlated. The correlation between the two is the 
result of the influence of rainfall on yield of rice and not the result 
of the influence of yield of rice on rainfall. Hence the yield of rice 

"will be taken as the dependent variable and the rainfall as the 
independent variable. Sometimes both the variables may take 
both ofthe roles. It happens when the correlation between them 
is the result of the influence of a third factor. For example, the +-ve 
correlation between the yield of rice/acre and of jute/acre is due 
to the fact that the two are related to the amount of rainfall. Thus 
the regression concept is concerned with the way in which the 
changes in one variable depend upon the simultaneous changes in 
the other variable (called the independent variable). In general, if 
Y is the dependent and X the independent factor, the recorded 
value of Y for any value of X will show the ordinary variation and 
thus it will cover a range-readings. The regression function is that 
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which expresses the average value (that might be expected) of one 
variable for any given value of the independent factor, 

We are concerned with the linear regression only i.e. when 
the regression function represents a straight ‘line called the 


regression line, The regression line of Y on X is defined by the 
equation : 


enviar a (Xx) | eee (i) 
ox 


where x and Y are the means of X and Y variables respectively and 
ox, cy their standard deviations ; r is the coefficient of correlation 
between the two variables, 
Similarly, the regression ~ of X on У may be 
XL = sy- Wits t PEEL (ii) 
In equations (i) and (ii), the! Variables on the rilht hand Side 
are the independent variablés “and those on the LHS. are the 


dependent. The тејаноп (iy is üSed to estirlàte Y Corresponding to 
agiven value of X and (ii) is Wed to^ “estimate the vàlüe 'of'X Гога 


given value © Y. We can replace r Su by бүх і іп Wd and r 2X by 
x сү 


сҮ 
byx =r E 
7 [е ст 


deem зүү 


TT ya E] 
ser 
where £ | | 


5,7 are the елдйн of Х'апа Y '{гбт "their assuined 


means, n —— —- is thé no. of observations. 
Similarly, 
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f Exp. 8. (a) Give the relation between the fegression 
coefficients and the correlation coefficient ? Also ‘give te 
difference between them ? 

(b) Point out the difference between totrelation and regression 
theory ? 

(c) How the regression lines give an idea of the correlation ? 

(d) What'is the point of intersection’ of the 2 fegression lines ? 


Sol (a) The corrélatión coefficient is the "ОМ. of the 
regression coefficients. As we Know that 
ey, 


byy =r 
ox 


П с, 
апа byy —r —. 


: „ су ©; З 
Hence, byx .bxy =r. St 
Ox OY 


r=V(byx . bxy )- 


The correlation coefficient is a measure of direction and extent 

‘of correlation between the two variables. ‘It indicates whether the 
‘change in one variable tends to bring a change in the other 
‘variable in the same or reverse direction. While the regression 
"coefficient gives the average change in one variable corresponding 
‘toa unit change in the other variable. ‘Thus yx gives-an average 
‘change in Y for a unit change in X. 

(b) The fundamental difference between the correlation and 
regression theory is that both the variables are considered as 
random in the former case while inthe later, one variable is 
independent and the other is dependent. 

(c) When the two regression lines coincide, the correlation 
is perfect and when they cut at rightangle, the correlation is zero. 
If the two lines diverge and intersect each Other, the correlation 
is limited. As the divergence between the two regression lines 
increases, the correlation decreases. 

(d) The two regression lines intersect at the point (X. Y) i.e. 
they pass through the means. 

Note. byx=tan 6, where @ is the angle which the regression 

lines of Y on X'makes with the X-axis. 

and bxy=tan ¢, where Ф is the angle which ihe regression 

line of X on Y-makes with the Y-axis. 
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Exp. No.9. (a) Find the two regression lines from the 
follewing data :— | v 
(b) Plotthelines on graph and give an idea 
of tr 7 


Fresh wts. (gms.) (x): 25, 27, 26, 29, 34, 35, 
Dry wts. (ems) (y: 2, 5, 7, 9, 19, 17, 


Sol.: (a) For thé computation of the two regression lines, 
we prepare the following table : 


4,—20 A,—10 
سس‎ ——— ————— 
X |t-X—A,| ж Y |-Y—A, 7? én 
25 —5 25 2 —8 64 40 
DYN 236 9 5 
26 EP. 16 7 
29 = 1 9 
34 +4 16 19 
3: +5 2s teg 
Totals | —4=у® |92—yz* | n—6 
Кеке a ЧУ Ырак MSIE 
у, х 


¥=A,+2— 104.5) 10—0:17—9:83 


Eén Em 139 9х =) 


n 
__139—0:67 _ 138-33 ove 
92—267 8933 — PDIOX: 
zo 1m 139- €-9 CD 
and bxy-— „== 


2)? = 
TS 


— 139—0:67_ 138-33 rs 
229—017 22883 — approx. 
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T herefore, the regression line of Y on X, will be 
Y—Y=byx (Х—Х) 


or Y —9:83—1:55 (X—29:33) T ы) 
and that of X on Y will be 
X—X —bxy (ҮҮ) " 
or X—29:33—0-60 (Y—9'83) АО ces) 
Hence the regression lines are Y=1'55X— 35°63 
and X=0:60Y +23:43 
(b) To plot the graphs of the regression lines 
Y—9:83—1:55 (X—29:33 "s 5-3 +--(i) 
and X—29:33—060(Y—983) = = <. (ii) 


we need at least two points for each. One of the points will be 
Mx(X, v)z:(29:33, 9:83) 
as the lines always pass through the point (X, Y). Fer anether 
point of equation (i). 
Put X=0 in (i), to get 
i Y=35°63. 
Hence the point say A=(0. 35°63). 
If we join M and A, we get the graph of equation (i). 
Similarly, put Y=0 in (ii), to get 
X=23°43. 
Hence the point say B=(23°43, 0). 
If we join M and B, we get the graph of equation (ii). 
Graphs for Regression Lines 


V. S. Small div.—2 units 


H.S.>1 Small div.—2 units - 


The gra: ў 
зе graphs of the two regression li imi 
of positive correlati ne bet СЕ x RT show the limited degree 
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The significance of correlation coefficient: In the case of 
large samples, (i. е. п > 100), to test the “significance of the 
` Correlation coefficient, we compute tbe statistic 
r = دال‎ 
2= 53 X V/(n— 1), since S. Е. (r)= Vai) 
which is a S N; V; 


Hence the. Properties. of S.N. У. сап be used to determine 
whether a correlation Coefficient is. significantly different from zero 
Or not. 


For illustration, consider the following example— 


` Exp. 10. A coefficient of correlation of 0:22 is obtained from 
а sample of 401 observations. Is this correlation significant, ? 


0:22 
Sol, -=_-_== _ 
Мо 20 
_ 44 
09516" 
z=4°6 approx., which is greater than 1:96 the value of 
S. N. V. at 5 percent. 


Hence the correlation coefficient is significantly different 
from zero at 5 percent level of significance. 


In the сазе of small samples (i, е. п < 100), the statistic 


ica У@—1) 


does not conform. to the normal or other standard form. 
Therefore, we find a simple function of ‘r’ which conforms to some 
standard form. For this, the function of “r is 


bet 
Уа У e-2. 
which is found to obey the ‘r° distribution with, (n—2) d. f. 
For 2=0 (population correlation coefficient), whatever be the 
size of the sample, we always calculate the statistic 
r 
t= س‎ „У (п=2), 
vary У 


and use the properties of ‘/ distribution. 


Saar <a 
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The significance of. the regression coefficient: To test the 


significance of Бух» we compute the statistic 


Бух 


аас ене ше COLE POGUE торша соц 


coefficient of regression, 


S S DE SSE 
and 5..Е. of (b,x) "i DSS, 3 


МИ 
ог CENT 
where S. S. xe © 


of X from assumed mean ia 


,&is the deviation 


or S. 5. —Xx*, where х=(Х— X) 
i. e. deviation from actual mean X. 
сз (Bn)? : s 
Similarly, S. EL NETT 7 is the deviation of Y from 


assumed mean Ay 


or. S. S. —Xy5 where 


y=(Y-¥) 
i. e. deviation from the actual 
mean Y, 

where ay’ =V (ey! — P3 d ox"). 


This statlstic conforms to 47 distribution with (n— 2) d. f. 
Similarly, to test the the significance of byy, we compute the 
statistic 


fart 
CS E. of byy 
where S. E. of b. e ERES SOS 
@-2)S.S, 
MEOS МЕ. 
2 «v (—2).o, 


where | оу = У(ох — bx -ay?). 
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Exp. 11. The thickness of 20 annual rings of a tree and the 
corresponding annudl'rainfall^were found to be correlated with а 
coefficient of +0°47. - Is this correlation significant 2" ^ ' ^ *^ 

Sol. We have r=0:47 ^ 

| n—20. 

So we compute (ће statistic. 


td zy» Vn 2). 
0:47 à 
очту) У 
0:47 
BE LO Vs: 


ENTE AES ZY 

zo ax (обот) 
—0:47 х ү(23:103581) 
=0°47 x 4:806 

—2:25896. | 

1222/26, 1.05(18)= 27101. 


Conelüsion': This calculated value of ‘1’ is greater than the 
tabulated value.of ? at 5 percent level of significance for 18 d. f. 
Hence the correlation coefficient is significant at 5 percent level. 


2n 


~ ——Á — 
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Exp. (12) : The following data are for the amount of water. 
applied in inches and yield of alfalfa in ‘tons/acre— 

Water (X) : 12, 'I8, ‘24; 30, 36; 42, 48 

Yield (Y) : 5:27, 5°68; 625, 7:21, 820, 8°67, 8:42. Find 
the regression of yield on water? Assuming that the relation 
between the two is linear, calculate: the expected yield when the 
amount applied is 20 inches., . ( M. Sc. Ag. Agra, 1959) 


50]. : Here the,water.isindependent variable and the yield 
is dependent variable and so they are represented, by X and Y 


respectively. To compute the regression line, the following table 
is prepared— . [ [ 
— _____________ 
| Е 
X |E—X—A, E Y^| т=у—А, | m en 
A=30 rh pes A=7'08: 
12 — 18 324 5*27 | —1-81 3°2761 32°58 
ely. aie 144 "^ 5:68 | —1-40 | 1-9600.| 16-80 
ane . . itor 
24° i6. 36 6:25 | —0°83 0*6889 4*98 
| « 
30 0 0 7:21 +0°13 0:1690 0 
a5) 6 | 36. | 8:20: |.4-1:12 1.2544 |. 6°72 
T | 
42 +12 А 144 8-67 +1°59 2°5281 19-08 
48 +180177 384. | gran | 4:1:34 17956 | 244i) 
| 33 
Totals 0—2X£ |1008=у°| n=7 | +0:14= 57 [11-6721 | 104-28 
<= суус" UN 
== 2; ! 
X—A bot =30+ 730i da 
y=A,+ T 08+ 1_7. 08-1-0: 02=7°10 tons. 
The regression of yield on water.is given by the equation — 
Supe 2820 
Y—Y-byx(X--X) whereibggesl-s: __104:28—0 


E NOB 1008—0 
n 
=0°1035 
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: Y —7:10—0-1035(X — 30). 
or Y=0:1035X+ 3-995 
For X=20", the expected yield will be 
Ү.=Ү--Бух(20— 30) 
ог Y,— 7-10--0:1035 х 10 
or Ү,=7:1004-1:035 
Я Y,=8'135 tons/aere, 
Thus the most likely yield of alfalfa per acre when water is 
20" is 8:135 tons. 


Exp. (13): Measurement of height of father (X inches) and 
of sons (Y inches) on thirty pairs of fathers and sons gave the 
following results to the nearest whole numbers, 


X—67 Y=68, S(X—x)*=225, S(Y—Y)?=256, 
S(X—xX)(Y—¥)=125 


Calculate the regression of the height of son on the height of 
the father and predict the mean height of sons of fathers whose 


height is 70 inches ? (M. Sc. Ag. Agra, 1956) 
Sol : We have n=30, 27222025, Бу'=256, Zxy—125, 
X=67, Y —68 


The regression of the height of sons (Y) on the height of 
fathers (X) is given by 


Y—Y=byx(X—X), where b,x (000—7) Z xy 


SX—xX9 = а 
125020 
ог Y —68—0'56(X —67) 
or Y=0°56X3-+-0°48 


To predict the mean height of sons of fa 
70 inches, we have 


Y,—Y--b, (X— 9) 
or Y,—68--0*56(70—67) 


x Y,=684-1°68=69°68 inches, It is the expected mean 
wight of sons of those fathers whore height is 70”. Ans. 


thers whose height is 
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Exp. (14): From 15 plots of cotton:the-number of plants per 
plot (X) and the corresponding yield (Y).were recorded and the 
following quantities obtained— 


S(X—xy—1756. — s(Y—Y?—79:6 
S(X—x)v—v)— 102-4, Y=11-4. Y=64 
Calculate the regression of yield on the no. of plants. and test 
its significance... What will be the equation of the- regression line 7 
(M. Sc. Ag. Agra, 1960)... 
Sol.: Here we take the yield (Y) as the- dependent variable 


and the no. of plants/plot (X) as the independent variable. To 
find the regression of yield on the no. of plants/plot, we compute 
E OX) (X У) Xy 810274; o r (Cx 

eA Sp E ITA ja HORIS di 


“y=(Y—¥) 


=0'583 
The regression line is given by— 
Y—Y=byx(X—X) 


or Y—64—0:583(X—11:4) 
or Y —6:4--0:583X— 6:6462 
or Y—0:583X—0:2462 
To test the significance of byx, we take the null hypothesis— 


Ho: That its population value is zero i. e. В=0 


Now we compute the statistic 
К byx—p Per S.S. У — p2 ух5. ss) 
E SIE of bys" where S. Е. of bes Jf -(n—2) S. 8, 
_ [[19:6— (583): Es) 
X ` 13x1756 
— /(79:6—0:3399 x 175:6 
S. E. of by = A XE 


- (26599866) (1921386) 
. 22828 ) af \ 228987 ) 
=v (-0009)=-03 approx. 


POSU 1943). шу эй 


Conclusion : The calculated value of; is greater than the 
tabulated value of ‘2’ at 5 percent level for 13 d. f. 


Hence the regression coefficient b yx is significant. 
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Exp. 15; -The- following: table: gives the temperature’ of 
anhusked rice and the percentage breakage of fice grain 10 
milling— 


Temp. (degree) : 33 34 29. 35 38 28 29 36 34 30 
% breakage : 37 26 26 27 30 24 25 28 30 24 


Calculate the- coefficient” of correlation between the 
temperature and percentage breakage and test its significance ? 
State. your conclusions 7 (M. Sc. Ag. Agra; 1961) 


Sok Taking. (X) for temperature (in degree) and (Y) for the 
percentage breakage of rice in milling, we compute the following 
table for the calculation of the coefficient of correlation— 


Ax=33 EE 
SKN. X |t-X—Ax| Е ү 1-Y-Ay| т | ën 
Џ | 

ES 0 0-| 37 | +10 100 0 
25534 | +1 iL. | и ess me |) 
3 | j9 | 4 16 | 26 | мађ Id || ee 
4 | 35 +2 4 2 0 0 0 
5 | 38 +5: 25 | дода 5 9 |15 
6 | | —5 2:90 БРНА is 9 |015 
7 | 29 —4 165: | 25 =) 4 | +48 
8 | 36 +3 ООВ il |) 403 
о | 34 +1 1 | 30 43 e ||| es) 
10 30 —3 9 24 —3 9 | +9 

us [cU : | 

Totals 0—10) —4=уё_| 106: „10 47259 | 143 | 56 


5 2 
= orrelation:and: Regression 25 
. Hence we compute, 


27 n= 


ЕБЕ 


z (4)(7) 
3 56 DE 


JG e jy do m) 


r 


$. A ва OB Sous 5 Ва oo 
ба (1381) /(01441764) 12007 
4 r=+0-49 


To test the significance of ‘r’, we take the null hypothesis— 
Ho: that the population correlation coefficient e— 0 


0:49 
“yoa VS 
х 210049 RO (Б) 5 
= VOB 0¥ 8=0 49x 05-8799 —0:49x:3:014 
. t=14769 ‚апа 1 (8)=2°306 
“05 


Conclusion : Thus the calculated value of ‘r’ is less than the 
tabulated value of 47? at 5% level of significance for 8 d. f. and we 
conclude that the ©” is not significant. It means that the 


temperature (in degree) and % breakage of rice in milling are 
uncorrelated. е 


Exp. 16. From observations of no. of bolls (X) and yield of 


‘seed cotton (Y)-«aken on 50 cotton- plants, the following quantities 
»were calculated — honig у 


X =80, "S (X—x )2= 1225°0 
"NY == 25 рта. 75 (Y —y-j2—441:0 gm? 
S (X—X (Y— Y )—451:0.gm. 5 
Calculate the regression of yield оп :boll number 'and-test.its 


significance ? Predict the mean vield per plant of the plants 
bearing 35 bolls, What would 


be the regression of boll number o 
H > + H n 
yield ? What is the precise pbysical meaning of two regression 
Coefficients ? (M. Sc. Ag. Agra 1958) 
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Sol. The regression of yield on boll number is given by 
bee S OE OQ -= үдә 36812037 
The regression of boll number on yield is given by, 
bee S (X-X )Y—Yv), 451 
> 5(“—у ) 441 
The regression line of yield on the number of bolls is— 
SP У —byx (X-X) 
or =25+0°37 (X—30) 
or xm 0:37X--13:9 - 
The mean yield per plant for the plant bearing 35 bolls will 
be given by— 
1 Ye=0:37x 35-++13°9 
or Ye--12:95--13:9 
or Ye=26°85 gm. 
To test the significance of byx, we take the null hypothesis— 


1:0226zz1:02 


Ho: thatthe value of the population regression coefficient 
is zero (i. e. 6-0). 
Then we compus the statistic, 


byx— S. Sy—byx S.Sx 
ТЕ. of Ba ——, where 5. E. of byx=, | (SE SSx (n—2) SS. 


E 1 (037) x 1225 
^ COST 


_ f 2713:2975 
= (- 58800 - ) 
—4/(0:004647— 0:068 
_ 037—0 | 


E Es L2 
= T068 5:44 mui (48)— 2-01 


Result: Thus the calculated value of ‘£’ is greater than the 
tabulated value of ‘1’ at 5% level of significance for 48 d. f. and we 
conclude that the regression of yield on boll number is significant. 


The coefficient of regression of Y on X and of X on Y give 
estimates of changes of Y and of X respectively for the 
corresponding unit changes in X and Y. 


Бі 2 
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Exp. 17. What is linear regression ? What principle is 
employed in obtaining the estimate of ‘b’ constants in a regression 
equation ? 

For two characters X and Y, it is found that 

r=+0°3 , сх=4 
х'—=25 | сү=5 
Ма—22/ 
Calculate (i) the expected value of X when Y=12 
(ii) the expected value of Y when X—33 
( M. Sc. Ag. Agra 1962) 
(iii) test the significance of byy for n—11. 

Sol. When the points on the scatter-diagram seem to cluster 
about a straight line, it suggests some linear relationship between 
the variables and this st. line is called the line of regression. It 
gives the average value of the dependent variable corresponding to 
à given value of the independent variable. When both the variables 
wake the role of ‘independent variable, we have two regression 
ines— 

(i) the regression line of Y on X: which gives the average 

value of Y for a given value of X. 
sro (ii) the regression line of X on Y: which givés the average 
value of X for a given value of Y. 

In general, the two lines аге different except in the case when 
the correlation is perfect. : 

„Further, Let Y=a+hX, be the equation of the line o^ 
regression of Y on X, where a and b are the constants of regressioni 
To estimate these constants a and b. the principle of least squarer 
is employed. : | 

The P. L. S. consists in finding the sum of squares of the 
Y from its estimated value and then minimizing it with respect to 
aand b: "The estimates of a and b thus obtained are— 

a-(— Y. x) 
9x / 
cov (X, Y) с 
уро м 
бү 


This estimated value b—r-2Y. is called the regression- 


о 


2 


coefficient of Y on X апа is written more precisely by the symbol] 


"yx. It gives the average change in Y for a unit change in X. 


The regression line of Y on X is given by 


Y—Y —byx(X— X ),where byx=r-S¥ L08x 5 


Bx a DO 
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or Y—22=1:0 (X—25) 
or Y—224-X—25 


or У=Х—3 
The expected value of Y when X=33, is given by 
(ii) Ye=33—3=30 s. Ye=30 


Similarly, the regression line of X on Y is given by 
X—X —bxy(Y — Y ),where bxy=r =08 x 3-06 


‘or X—25=0:64 (Y—22) 
ог X—25--0:64Y — 14:08 
or X—0'64Y --10:92 
j The expected value of X when Y = 12, is given by 
(i) Хе=0:64х 12+10°92=7°68+ 10°92= 18°50 
Хе=18:60 
Hence we have Y—X—3, Х==0:64Ү-- 10:92 
Ye—30, Хе= 18:60 Апе, 


We have computed the regression coefficient byx=1°0. To 
test the significance of Буу, we take the null hyp. 


~ Ho.: That the population value (B) of the regression 
coefficient is zero. 


For this we compute the statistic, 


Бух 6 s 
سا کا‎ JE: A ET ML 
Sau А lx qe WES 
and oly =V (oy 3 Буу. пу) 
_10—0 
= =V[25—(1)2x 16] 
792 
тае і EE 
A ENS обр оуд 2 Û 257 
But 1.95(9)— 2:262. 


Result: The calculated value of ¥ is greater than the 


tabulated value of ‘t at 5% level for 9 d. f. Hence we conclude 
that the regression coefficient by, is significant. 
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map. (18): Ina partially destroyed laboratory record of the 
correlation analysis of data, the following results only are legible : 
Variance of X —9'0, 
Regression lines, 8X—10Y +66=0 
and 40X—18Y=214 
Supply (i) the mean values of X and Y, 
(ii) the S. D. of Y, 
and (iii) the correlation coefficient between X and Y. 
(M. Sc. F. 1963) 
Sol. (i) We know that the two regression lines intersect at 
the point (X, Y). Hence, the means can be found out by solving 
the two regression equations : i 
8x—10Y=— 66 
40x—18y —214 
40x—50y-— — 330 
40x —18y—214 
—32y=—544 or Y=17 
and X-—13 


or subtracting we get 


(iii) Among the two equations, we are not certain which one 
of them is the regression of Y on X and the regression of X on Y. 
There may be two possib ilities : 

(a) 8X —10Y 4-66—0 may be the regression line of X on У. 
or 8X—10Y —66 » » » E > 
and  40X—18Y =214 may be the regression line of Y on X. 
or 187 =40Х —214 к. 


» » » » 


Then by- D and bas 


a 10 40 m 
So byy. Бух == хт > 1, which is not possible. 


(b) 8X —10Y 4-66 —0 may be the regression line of Y on X 


or 10Y —8X4-66 55 » » » » 
and  40X—18Y—214 тау be the regression line of X on Y 
or 40X=18Y +214. » » » » » 


From these regression lines, we have 


8 1 
byx=75 and bxy ду. 
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8 18. К : 
Hence byx-bxy=75 X ду 0:36 < 1, hence this set of regression 
equations is possible. 


Now US 
or r= 3-076, 
But we take r— 40'6 as the sign of the two regression coefficients 
are the same. 


Hence the correlation between X and У is + ve. 


(ii) We know that byxer E 


ox 
T 8 A. ay г 
ог 1079 6 == Е: 
8x3 f 
10x06 ^ 
4-6» 


Hence we get, 


# 
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EXERCISE 5. 

О. (1) Define the correlation coefficient ? Find е 
correlation coefficient for the following values of X and Y— 

cq c Оа 7 20 4202; 111244 8625 

ES Ба И ЗОТ зри д5: а зае Н ДО; "420 а 145 

Q. (2) What are the regression lines ? Give their point of 
intersection, . 

From the following data, calculate (а) the regression 
coefficient of monthly yields of guinea grass on age of grass, (b) the 
regression line of monthly yields of guinea grass on age of grass ? 
Age (X) : 16 2, 3, 4, 5; 6, 7 8 
Monthly yield (Y) : 85, 73, 65, 41, 20, 36, 16, 24 

Q.3. Give the relation between the correlation coefficient 
and the regression coeflicients ? 

The following table gives the heights and dry weights of 17 
plants, calculate 

(i) the cozfficient of correlation, 
(ii) the regression coefficient of weights on heights, 
and (iii) the regression line of weights on heights. 

Height 

(X inches): 50 44 54 46 64 37 77 62 45 69 44 57 63 68 45 71 50 
dry wts. 
(Y gms.): 44 25 44 26 41 21 60 52 15 55 42 58 37 36 40 58 36 

Q. (4) What do you understand by perfect positive and 
perfect negative correlation? Is a significance test needed for this 
value? The correlation coefficient between days to head and 
days to mature for 25 varieties was found to b2 0:4, is this 
significant ? [M. Sc. Ag. Agra, 1962] 

О. (5) Explain the term regression equation? Why should 
there be two regression equations 7 Write дома the two regression 
equations that may be associated with the folloving pairs of 
values— 

X: 152, 114, 138, 154, 144, 153, 141, 117, 156 
Y: 193, 300, 414, 594, 676, 549, 320, 483, 481 (1. А. S. 1951) 

О. 6. Тһе ranks of the same 16 students in two subjects А 
and B were as follows—(Two numbers within brackets denote the 
ranks of the students in A and B respectively) 


(L1) (2,10), (3,3), (44), (5,5), (6,7), (7,2), (8,6) (9,8) 
(10, 11), (11, 15) (12,9) (13,14) (14,12) (15,16) (16, 13) у w. 
Calculate the rank correlation for proficiencie 


: : Sof this grou 
in subject A and B 2 Yost, 


ГМ. Sc. Agra, 1952] 
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Q.7. Astudy of wheat prices at Hapur and Karanchi 
yields the following data— 


Hapur Karanchi 
Average price/maund : 2:463 27197 
Standard deviation : 0:326 0:207 


The coefficient of correlation between two prices is +0:774. 
Estimate from the above data, the most likely price of 
wheat — 
(i) at Hapur, corresponding to the price of Rs. 2'334 at 
Karenchi. 
(ii) at Karanchi, corresponding to the price of Rs. 3:052 at 
Hapur. 
Q.8. For two characters X and Y, 
X=30, ү=7:143, x (X—x)?=1008, 
Z(Y-Y—1L017, E(X—xX)(Y—Y)—1020, . 
calculate the coefficient of regression. of Y on X and test its 
significance ? 
Q.9. (a) Find the coefficient of correlation between X and 
Y, and their means from the following regression lines : 
10Y —16X—21—0 
and 2Y— 5X+16=0 


(b) Plot the lines on the graph and give your view about the 
correlation ? 


(c) From the graph obtain X, Y and compare them with 
those of (a). 


О. 10. Find the most likely price in Bombay corresponding 
to the price of Rs. 70 at Calcutta from the following data : ' 


Average price at Calcutta 65 Rs. 
5: » ,و„‎ Bombay 67 Rs. 
S. D. at Calcutta 2:5 Rs. 
» . Bombay SERS! 


Coefficient of correlation is +0-8 between the t 


) wo prices of 
(ie commodity in the two cities. 


(М. A. Agra, 1954) 
Q. 11. From the following data, find ош the: two regression 
coefficients ? 


Xb——1, 351—229; y= —4, 7—92, vy£g—139, 
n=6, where É, т} are the deviations from the assumed 
means. 


Also test the significance of the coefficients 1 
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O. 12. (a) In the calculation of regression coefficient for 
18 pairs of observations, it was found that byx —1:12 and S. E. 
фух=0-06. Is the estimate significantly different from the expected 
value 1:0. . (Hint : take B=1°0) 
(b) For 8 pairs of observations of X and Y, the following were 
calculated : 
=X= 360, XY—36, УХУ—1210, xX?— 20988, 
DY?=204. 
Find out bxy and test its significance ? 
Q. 13. The following table enumerates the marks obtained 
by a class of students in statistics in 1st. and 2nd. paper : 
Marks in Ist paper (X) :. 
80, 45, 55, 56, 58, 60, 65, 68, 70, 75, 85. 
Marks in 2nd paper (Y) : 
82, 56, 50, 48, 60, 62. 64, 65,70, 74, 90. 


Calculate (i) The regression coefficients ? 
(B. Sc. Agra, 1956) 
(ii) Obtain the scatter diagram and comment on 
Correlation between X and Y. 
Q.14. A sample of yaired variates is given below : 
АОИ 53 CES 
а DUE IUS 6 3 4 35 11 
„  () Calculate the correlation" between the two variables and 
Interpret the same ? 
(b) Illustrate the scatter of the points in a diagram ? 
(M. Sc. Ag. Agra, 1965) 
(c) In a sample of 20 pairs of values, total solids and fat 
contents gave the Correlation coefficient of 0:6. What would you 
conclude from this ? (M. Sc. Ag. Agra, 1959) 


u) 
(2) 
(3) 


(4) 
(5) 
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ANSWERS 
r=+0°98 
(а) byx=—9°762, (b) У—45=—9'762 (X—4'5) 
(i) r=0°7306 (ii) бух=0'8394 
(iii) Y—40°59=0°8394 (X— 55°65) 
t=5:23 
Y —444:45—2:408 (X— 141) 


and Х—141=0'025 (Y —444'45) 


(6) 
(7) 
(8) 
(9) 
(10) 
(11) 
(12) 
(13) 
(14) 


r=0'8 ; 
(i) 1:899 Rs. (ii) 3:086 Rs. 

byx —07101, t=732 approx; 

(a) r—--0:8, X=11:22 approx., Y 20:05 
У =72'6 Rs. 

byx=0°60, bxy=1°55, !=7'5, !=7'75. 
(a) t=2, (b) bxy=—9°762, (PSPs. 

(i) byx=0°99, (ii) bxy=0°85. 

(a) r—-4-0:98, (с). £—3:18. 


Appendix I & II 
Tables & A. U. Papers 


TS e o -—-- 


*£— Table (values at 5% and 1% levels) 


Probability: 


EDUCATIONAL PSYCHOLOGY QUESTIONS 
Occurred in А. U. B. T. EXAMINATIONS 


l. Discuss the meaning and importánce of measures of 
central tendency and dispersion, with special reference to their 
use in the handling of a class. (A. U. B. T., 1960) 


2. Calculate coefficient of correlation from the data given 
below by Spearman's rank method and interpret the result :— 


Marks in Eng. Marks in Maths. 
A о 22 32 
В 27 39 
С 28 40 
D 15 20 
E 32 21 
E 14 21 
G 7 16 
H 9 35 
I d 5 48 
J 9 16 


(A. U. B. Т., 1961) 


3. Explain fully the meaning and importance of measures of 
central tendency and dispersion, with special reference to their 
use in the handling оЃа class. (A. U. B. T., 1962) 


4. The following are the scores in English and Mathematics 
Of ten students. Calculate the coefficient of correlation and 
Comment upon the result, 
ABCDEFGHIJ 
English ... 25 27 30 40 37 15 14 60 80°10 
Mathematies. ... 24 29 35 60 45 10 13: 50 70 21 


(A. U. B. T., 1962) 


1 


5. What is standard deviation ? 


Calculate the standard 
deviation in the following distri bution :— 


Units Frequency 
09 E 10 
- 10-19 а Ба 20 
20-29 20 
30-39 50 
40-49 ; 40 
50-59 15 
60-69 ... 4 


(A. U. B. T., 1963) 


‘Coefficient of Correlation’ ? 
ween English and Mathematics 
ks of eight students in a class— 


Mathematics 


6. What do you understand by 
Calculate the degree of closeness bet 
in the following distribution of mar 


English 


Hommugaw» 
5 


(A. U. B. T., 1964) 


7. The IQ and scores on handwriting test of 25 pupils are 


given below. Calculate the coefficient of Correlation between 
them and interpret the result. Use rank-difference method. 
Scores on handwriting test :— 75, 58, 55, 50, 40, 62, 57, 53, 
49, 38, 61, 57, 53, 48, 36, 59, 56, 51, 47, 35, 59, 56, 50, 46 and -4. 
10:—118, 108, 107, 102, 100, 122, IS, MIS! 100, 97, 119, 
112, 112, 103, 93, 110, 109, 109, 95, 89, 110, 108, 108, 98 and 90. 


(A. U. B. T., 1959) 
Of assessing perso- 
nd reliability, 

(A. U. B. T., 1960) 
*? How is the 
ough them ? What 


ucational purposes ? 
(^. U. B. T., 1961) 


8. Discuss the more ; 


mportant method 
nality, with special reference to their validity a 


9. What are group tests of intelligenc 
intelligence of a group of children assessed thr 
use can the school make of their results for ed 
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